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ODORANT RECEPTORS AND USES THEREOF 

Background of the Invention 

This application is a continuation-in-part of U.S. Serial 
No. 681,880, filed April 5, 1991, the contents of which are 
hereby incorporated by reference. 

Throughout this application, various publications are 
referenced by Arabic numerals within parentheses. Full 
citations for these publications may be found at the end of 
the specification immediately preceding the claims. The 
disclosures of these publications in their entireties are 
hereby incorporated by reference into this application in 
order to more fully describe the state of the art as known 
to those skilled therein as of the date of the invention 
described and claimed herein. 

In vertebrate sensory systems, peripheral neurons respond to 
environmental stimuli and transmit these signals to higher 
sensory centers in the brain where they are processed to 
allow the discrimination of complex sensory information. 
The delineation of the peripheral mechanisms by which 
environmental stimuli are transduced into neural information 
can provide insight into the logic underlying sensory 
processing. Our understanding of color vision, for example, 
emerged only after the observation that the discrimination 
of hue results from the blending of information from only 
three classes of photoreceptors (1, 2, 3, 4). The basic 
logic underlying olfactory sensory perception, however, has 
remained elusive. Mammals possess an olfactory system of 
enormous discriminatory power (5, 6) . Humans, for example, 
are thought to be capable of distinguishing among thousands 
of distinct odors. The specificity of odor recognition is 
emphasized by the observation that subtle alterations in the 
molecular structure of an odorant can lead to profound 
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changes in perceived odor. 

i 

The detection of chemically distinct odorant presumably 
results from the association of odorous ligands with 

i 

5 specific receptors on olfactory neurons which reside in a 
specialized epithelium in the nose. Since these receptors 
have not been identified, it has been difficult to determine . 
how odor discrimination might be achieved. It is possible 
that olfaction, by analogy with color vision, involves only 

10 a few odor receptors, each capable of interaction with 
multiple odorant molecules. Alternatively, the sense of 
smell may involve a large number of distinct receptors each 
capable of associating with one or a small number of 
odorant. In either case, the brain must distinguish which 

15 receptors or which neurons have been activated to allow the 
discrimination between different odorant stimuli. Insight 
into the mechanisms underlying olfactory perception is 
likeLY to depend upon the isolation of the odorant 
receptors, and the characterization of their diversity, 

20 specificity, and patterns of expression. 

The primary events in odor detection occur in a specialized 
olfactory neuroepithelium located in the posterior recesses 
of the nasal cavity . Three cell types dominate this 

25 epithelium (Figure 1A) : the olfactory sensory neuron, the 
sustentacular or supporting cell, and the basal cell which 
is a stem cell that generates olfactory neurons throughout 
life (7, 8). The olfactory sensory neuron is bipolar: a 
dendritic process extends to the mucosal surface where it 

30 gives rise to a number of specialized cilia which provide an • 
extensive, receptive surface for the interaction of odors 
with olfactory sensory neurons. The olfactory neuron also * 
gives rise to an axon which projects to the olfactory bulb 
of the brain, the first relay in the olfactory system. The 

35 axons of the olfactory bulb neurons, in turn, project to 
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subcortical and cortical regions where higher level 
processing of olfactory information allows the 
discrimination of odors by the brain. 

The initial events in odor discrimination are thought to 
involve the association of odors with specific receptors on 
the cilia of olfactory neurons. Selective removal of the 
cilia results in the loss of olfactory response (9) . 
Moreover, in fish, whose olfactory system senses amino acids 
as odors, the specific binding of amino acids to isolated 
cilia has been demonstrated (10, 11) , The cilia are also 
the site of olfactory signal transduction. Exposure of 
isolated cilia from rat olfactory epithelium to numerous 
odorant leads to the rapid stimulation of adenylyl cyclase 
and elevations in cyclic AMP (an elevation in IP3 in 
response to one odorant has also been observed) (12, 13, 14, 
15) . The activation of adenylyl cyclase is dependent on the 
presence of GTP and is therefore likely to be mediated by 
receptor-coupled GTP binding proteins (G-proteins) (16) . 
Elevations in cyclic AMP, in turn, are thought to elicit 
depolarization of olfactory neurons by direct activation of 
a cyclic nucleotide-gated, cation permeable channel (17, 
18) . This channel is opened upon binding of cyclic 
nucleotides to its cytoplasmic domain, and can therefore 
transduce changes in intracellular levels of cyclic AMP into 
alterations in the membrane potential. 

These observations suggest a pathway for olfactory signal 
transduction (Figure IB) in which the binding of odors to 
specific surface receptors activates specific G-proteins. 
The G-proteins then initiate a cascade of intracellular 
signalling events leading to the generation of an action 
potential which is propagated along the olfactory sensory 
axon to the brain. A number of neurotransmitter and hormone 
receptors which transduce intracellular signals by 
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activation of specific G-proteins have been identified. 
Gene cloning has demonstrated that each of these receptors 
is a member of a large superfamily of surface receptors 
which traverse the membrane seven times (19, 20). The 
5 pathway of olfactory signal transduction (Figure IB) 
predicts that the odorant receptors might also be members of 
this superfamily of receptor proteins. The detection of 
odors in the periphery is therefore likely to involve 
signalling mechanisms shared by other hormone or 

10 neurotransmitter systems, but the vast discriminatory power 
of the olfactory system will require higher order neural 
processing to permit the perception of individual odors. 
This invention address the problem of olfactory perception 
at a molecular level. Eighteen different members of an 

15 extremely large multigene family have been cloned and 
characterized which encodes seven transmembrane domain 
proteins whose expression is restricted to the olfactory 
epithelium. The members of this novel gene family encode 
the individual odorant receptors. 



20 
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fiUMMAKY OF THE INVENTION 

The invention provides an isolated nucleic acid, e.g. a DNA 
and cDNA molecule, encoding an odorant receptor . The 
5 invention further provides expression vectors containing 
such nucleic acid. Also provided by the invention is a 
purified protein encoding an odorant receptor. The 
invention further provides a method of transforming cells 
which comprises transfecting a suitable host cell with a 
10 suitable expression vector containing the nucleic acid 
encoding the odorant receptor. 

The invention also provides methods of identifying odorant 
ligands and of identifying odorant receptors. The invention 

15 further provides methods of developing fragrances, of 
identifying appetite suppressant compounds, of controlling 
appetite. The invention also provides methods of 
controlling insect and other animal populations. The 
invention additionally provides a method of detecting odors 

20 such as the vapors emanating from Cocaine, Marijuana, 
Heroin, Hashish, Angel Dust, gasoline, decayed human flesh, 
alcohol, gun powder explosives, plastic explosives, 
firearms, poisonous or harmful smoke, or natural gas. 



• 



• 
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Description of the Figures 

Figure 1. The Olfactory Neuroepithelium and a Pathway for 
Olfactory Signal Transduction . A. The Olfactory 
Neuroepithelium. The initial event in odor perception 
occurs in the nasal cavity in a specialized neuroepithelium 
which is diagrammed here. Odors are believed to interact 
with specific receptors on the cilia of olfactory sensory 
neurons. The signal generated by these initial binding 
events are propagated by olfactory neuron axons to the 
olfactory bulb. B. A Pathway of Olfactory Signal 
Transduction. In this scheme, the binding of an odorant 
molecule to an odor-specific transmembrane receptor leads to 
the interaction of the receptor with a GTP-binding protein 
(G S j olf j) . This interaction, in turn, leads to the release 
of the GTP-coupled a-subunit of the G-protein, which then 
stimulates adenylyl cyclase to produce elevated levels of 
cAMP. The increase in cAMP opens nucleotide-gated cation 
channels, thus causing an alteration in membrane potential. 

Figure 2. A PCR Amplification Product Containing Multiple 
Species of DNA . cDNA prepared from olfactory epithelium RNA 
was subjected to PCR amplification with a series of 
different primer oligonucleotides and the DNA products of 
appropriate size were isolated, further amplified by PCR, 
and size fractionated on agarose gels (A) (For details, see 
text) . Each of these semipurif ied PCR products was digested 
with the restriction enzyme, Hinf I, and analyzed by agarose 
gel electrophoresis. Lanes marked H M W contain size markers 
of 23.1, 9.4, 5.6, 4.4, 2.3, 2.0, 1.35, 1.08, 0.87, 0.60, 
0.31, 0.28, 0.23, 0.19, 0.12 and 0.07kb. (B) . Twenty- 
two of the 64 PCR products that were isolated and digested 
with Hinf I are shown here. Digestion of one of these, PCR 
13, yielded a large number of fragments whose sizes summed 
to a value much greater than that of the undigested PCR 13 
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DNA, indicating that PGR 13 might contain multiple species 
of DNA which are representatives of a multigene family. 

Figure 3* Northern Blot Analysis with a Mixture of Twenty 
Probes ■ One j*g of polyA+ RNA isolated from rat olfactory 
epithelium, brain, or spleen was size-fractionated in 
formaldehyde agarose, blotted onto a nylon membrane, and 
hybridized with a 32 P-labeled mixture of segments of 20 cDNA 
clones. The DNA segments were obtained by PCR using primers 
homologous to transmembrane domains 2 and 7. 

Figure 4. The Protein Sequences Encoded by Ten Divergent 

cDNA Clones . Ten divergent cDNA clones were subjected to 
DNA sequence analyses and the protein sequence encoded by 
each was determined. Amino acid residues which are 
conserved in 60% or more of the proteins are shaded. The 
presence of seven hydrophobic domains (I-VII) , as well as 
short conserved motifs shared with other members of the 
superfamily, demonstrate that these proteins belong to the 
seven transmembrane domain protein superfamily. Motifs 
conserved among members of the superfamily and the family of 
olfactory proteins include, the GN in TM1 (transmembrane 
domain l), the central W of TM4 f the Y near the c-terminal 
end of TM5 , and the NP in TM7. In addition, the DRY motif 
C-terminal to TM3 is common to many members of the G- 
protein-coupled superfamily. However, all of the proteins 
shown here share sequence motifs not found in other members 
of this superfamily and are clearly members of a novel 
family of proteins. 

Figure 5 . Positions of Greatest Variability in the 
Olfactory Protein Family . in this diagram, the protein 
encoded by cDNA clone 115 is shown traversing the plasma 
membrane seven times with its N-terminus located 
extracellular ly, and its C-terminus intracellularly. The 
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vertical cylinders delineate the seven putative a-helices 
spanning the membrane. Positions at which 60% or more of * 
the 10 clones shown in Figure 4 share the same residue as 
115 are shown as white balls. More variable residues are ; 
5 shown as black balls. The high degree of variability 
encountered in transmembrane domains III, IV, and V is 
evident in this schematic. 

Figure 6 * The Presence of Subfamilies in a Divergent 
Multigene Family . Partial nucleotide sequences and deduced 
protein sequences were obtained for 18 different cDNA 
clones- Transmembrane domain V along with the flanking loop 
sequences, including the entire cytoplasmic loop between 
transmembrane domains V and VI, are shown here for each 
protein. Amino acid residues found in 60% or more of the 
clones in a given position are shaded (A) . This region of 
the olfactory proteins (particularly transmembrane domain V) 
appears to be highly variable (see Figure 4) . These 
proteins, however, can be grouped into subfamilies (B,C,D) 
in which the individual subfamily members share considerable 
homology in this divergent region of the protein. 

Figure 7. Southern Blot Analyses, with Non-crosshvbridizing 
Fragments of Divergent cDKAs . Five /xg of rat liver DNA was 
25 digested with Eco RI (A) or Hind III (B) , electrophoresed in 
0.75% agarose, blotted onto a nylon membrane, and hybridized 
to the 32 P-labeled probes indicated. The probes used were 
PCR-generated fragments of: 1, clone F9 (identical to F12 
in Figure 4); 2, F5; 3, F6; 4, 13; 5, 17; 6, 114; or 7, 115. 
30 The lane labeled 11 1*7" was hybridized to a mixture of the 
seven probes. The probes used showed either no 
crosshybridization or only trace crosshybridization with one 
another. The size markers on the left correspond to the 
four blots on the left (1-4) whereas the marker positions 
35 noted on the right correspond to the four blots on the right 
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(5-7, "1-7"). 



Figure 8 . Northern Blot Analysis with a Mix of Seven 
Divergent Clones . One /ig of polyA+ RNA from each of the 
tissues shown was size- fractionated, blotted onto a nylon 
membrane, and hybridized with a 32 P-labeled mixture of 
segments of seven divergent cDNA clones (see Legend to 
Figure 7) . 



10 



Figure 9. The amino acid and nucleic acid sequence of clone 
F3. 



15 



Figure 10. The amino acid and nucleic acid sequence of 
clone F5. 

Figure 11. The amino acid and nucleic acid sequence of 
clone F6. 



20 



Figure 12. The amino acid and nucleic acid sequence of 
clone F12. 



Figure 13 . The amino acid and nucleic acid sequence of 
clone 13 . 

25 Figure 14 . The amino acid and nucleic acid sequence of 
clone 17. 



30 



Figure 15. The amino acid and nucleic acid sequence of 
clone 18 . 

Figure 16. The amino acid and nucleic acid sequence of 
clone 19. 



Figure 17. 
35 clone 114. 



The amino acid and nucleic acid sequence of 
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Figure 18. The amino acid and nucleic acid sequence of 
clone 115. 

Figure 19. The amino acid and nucleic acid sequence of 
5 human clone H5. 

Figure 20. The amino acid and nucleic acid sequence of 
clone Jl, where the reading frame starts at nucleotide 
position 2. 

10 

Figure 21. The amino acid and nucleic acid sequence of 
clone J2. 

Figure 22. The amino acid and nucleic acid sequence of 
15 clone J4, where the reading frame starts at nucleotide 
position 2 . 

Figure 23. The amino acid and nucleic acid sequence of 
clone J7 , where the reading frame starts at nucleotide 
20 position 2. 

Figure 24. The amino acid and nucleic acid sequence of 
clone J8 , where the reading frame starts at nucleotide 
positon 2. 

25 

Figure 25. The amino acid and nucleic acid sequence of 
clone Jll. 

Figure 26. The amino acid and nucleic acid sequence of 
3 0 clone J14 , where the reading frame starts at nucleotide 
position 2. 

Figure 27 . The amino acid and nucleic acid sequence of 
clone J15, where the reading frame starts at nucleotide 
35 psition 2. 
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Figure 28. The amino acid and nucleic acid sequence of 
clone J16, where the reading frame starts at nucleotide 
position 2 . 

5 Figure 29. The amino acid and nucleic acid sequence of 
clone J17, where the reading frame starts at nucleotide 
position 2. 

Figure 30. The amino acid and nucleic acid sequence of 
10 clone J19, where the reading frame starts at nucleotide 
position 2. 

Figure 31. The amino acid and nucleic acid sequence of 
clone J20 r where the reading frame starts at nucleotide 
15 position 2. 

Figure 32. SOUTHERN BLOT: Five micrograms of DNA isolated 
from 1. Human placenta, 2. NCI-H-1011 neuroblastoma cells, 
or 3. CHP 134 neuroblastoma cells were treated with the 

20 restriction enzyme A. Eco RI, B. Hind III, C. Bam HI, or D. 
Pst I, and then electrophoresed on an agarose gel and 
blotted onto a nylon membrane. The blotted DNA was 
hybridized to the 32P-labeled H3/H5 sequence. An 
autoradiograph of the hybridized blot is shown with the 

25 sizes of co-electrophoresed size markers noted in kilobases. 



i 
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Detailed Description of the Invention 

The invention provides an isolated nucleic acid, e.g. a DNA 
or cDNA molecule, encoding an odorant receptor. Such a 
5 receptor is a receptor which binds an odorant ligand and 
include but not limited to pheromone receptors. An odorant 
ligand may include, but is not limited to, molecules which 
interact with the olfactory sensory neuron, molecules which 
interact with the olfactory cilia, pheromones, and molecules 
10 which interact with structures within the vomeronasal organ. 

The invention specifically provides the isolated cDNAs 
encoding odorant receptors the sequences of which are shown 
in Figures 9-31. The nucleic acid is most typically a cDNA 
15 and encodes an insect, a vertebrate, a fish or a mammalian 
odorant receptor. The mammalian odorant receptor is 
preferably a human, rat, mouse or dog receptor. In an 
embodiment, human odorant receptor cDNA sequence and the 
correspondent protein is isolated (Figure 19) . 

20 

In another embodiment, phermone receptors are isolated and 
shown as clones Jl, J2, J4, J7, J8, Jll, J14, J15, J16, J17, 
J19 and J20 (Figures 20-31) . 

25 The invention further provides expression vectors containing 
cDNA which encodes odorant receptors* Such expression 
vectors are well known in the art and include in addition to 
the nucleic acid the elements necessary for replication and 
expression in a suitable hosts. Suitable hosts are well 

3 0 known in the art and include without limitation bacterial 
hosts such as E. coli, animal hosts such as CHO cells, 
insect cells, yeast cells and like. 

The invention also provides purified proteins encoding 
35 odorant receptors. Such proteins may be prepared by 
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expression of the f orementioned expression vectors in 
suitable host cells and recovery and purification of the 
receptors using methods well known in the art. Examples of 
such proteins include those having the amino acid sequences 
5 shown in figures 9-31. 

The purified protein typically encodes an insect, 
vertebrate, fish or mammalian odorant receptor. The 
mammalian odorant receptor may be a human, rat, mouse or 
10 dog. 

In one embodiment the invention provides a novel purified 
protein which belong to a class of proteins which have 7 
transmembrane regions and a third cytoplasmic loop from the 
15 N-terrainus which is approximately 17 amino acid long and to 
nucleic acid molecules encoding such proteins. 

The invention provides methods of transforming cells which 
comprises transfecting a suitable host cell with a suitable 

20 expression vector containing nucleic acid encoding of the 
odorant receptor. Techniques for carrying out such 
transformations on cells are well known to those skilled in 
the art. (41,42) Additionally, the resulting transformed 
cells are also provided by the invention. These transformed 

25 cells may be either olfactory cells or non-olfactory cells. 
One advantage of using transformed non-olfactory cells is 
that the desired odorant receptor will be the only odorant 
receptor expressed on the cell's surface. 

30 In order to obtain cell lines that express a single receptor 
type, standard procedures may be used to clone individual 
cDNAs or genes into expression vectors and then transfect 
the cloned sequences into mammalian cell lines. This 
approach has been used with sequences encoding some other 

35 members of the seven transmembrane domain superfamily 
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including the 5HTlc serotonin receptor. (43) The cited work 
illustrates how members of this superf amily transferred into 
cell lines may generate immortal cell lines that express 
high levels of the transfected receptor on the cell surface 
5 where it will bind ligand and that such abnormally expressed 
receptor molecules can transduce signals upon binding to 
ligand. 



The invention also provides a method of identifying a 
10 desired odorant ligand which comprises contacting 
transformed non-olfactory cells expressing a known odorant 
receptor with a series of odorant ligands to determining 
which ligands bind to the receptors present on the non- 
olfactory cells. 

15 

Additionally, the invention provides a method of identifying 
a desired odorant receptor comprising contacting a series of 
transformed non-olfactory cells with a known odorant ligand 
and determining which odorant receptor binds with the 
20 odorant ligand. 

The invention provides a method of detecting an odor which 
comprises: a) identifying a odorant receptor which binds 
the desired odorant ligand and; b) imbedding the receptor 
25 in a membrane such that when the odorant ligand binds to the 
receptor so identified a detectable signal is produced. In 
one embodiment of the invention the membrane used in this 
method is cellular, including a membrane of an olfactory 
cell or a synthetic membrane. 

30 

The ligand tested for may be the vapors emanating from 
Cocaine, Marijuana, Heroin, Hashish, Angel Dust, gasoline, 
decayed human flesh, alcohol, gun powder explosives, plastic 
explosives or firearms. In another embodiment the ligand 
35 tested for may be natural gas, a ph romone, toxic fumes, 
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noxious fumes or dangerous fumes. 

In one embodiment of the invention the detectable signal is 
a lightbulb lighting up, a buzzer buzzing, a bell ringing, 
5 a color change, phosphorescence, or radioactivity. 

The invention further provides a method of quantifying the 
amount of an odorant ligand present in a sample which 
comprises utilizing the above-mentioned method for odor 
10 detection and then quantifying the amount of signal 
produced . 

The invention further provides a method of developing 
fragrances which comprises identifying a desired odorant 

15 receptor by the above method, then contacting non-olfactory 
cells, which have been transfected with an expression vector 
containing nucleic acid encoding the desired odorant 
receptor such that the receptor is expressed upon the 
surface of the non-olfactory cell, with a series of 

20 compounds to determine which compound or compounds bind the 
receptor . 

The invention provides to a method of identifying an 
"odorant fingerprint" which comprises contacting a series of 
25 cells, which have been transformed such that each express a 
known odorant receptor, with a desired sample and 
determining the type and quantity of the odorant ligands 
present in the sample. 

30 The invention provides a method of identifying odorant 
ligands which inhibit the activity of a desired odorant 
receptor which comprises contacting the desired odorant 
receptor with a series of compounds and determining which 
compounds inhibit the odorant ligand - odorant receptor 

3 5 interaction. 
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The invention also provides for a method of identifying 
appetite suppressant compounds which comprises identifying 
odorant ligands by the method mentioned in the preceding 
paragraph wherein the desired odorant receptor is that which 
5 is associated with the perception of food. Additionally, 
the invention provides a method of controlling appetite in 
a subject which comprises contacting the olfactory 
epithelium of the subject with these odorant ligands. 
Further the invention provides a nasal spray, to control 
10 appetite comprising the compounds identified by the above 
method in a suitable carrier. 



The invention provides a method of trapping odors which 
comprises contacting a membrane which contains multiples of 
15 the desired odorant receptor, with a sample such that the 
desired odorant ligand is absorbed by the binding of the 
odorant ligand to the odorant receptor. The invention also 
provides an odor trap employing this method. 



20 The invention also provides a method of controlling pest 
populations which comprises identifying odorant ligands by 
the method mentioned above which are alarm odorant ligands 
and spraying the desired area with the identified odorant 
ligands. Additionally, provided by the invention is a 

25 method of controlling a pest population which comprises 
identifying odorant ligands by the above mentioned method, 
which interfere with the interaction between the odorant 
ligands and the odorant receptors which are associated with 
fertility. In one embodiment the pest population is a 

3 0 population of insects or rodents, including mice and rats. 

The invention also provides a method of promoting fertility 
which comprises identifying odorant ligands which interact 
with the odorant receptors associated with fertility by the 
35 above mentioned method. Further, the invention provides a 
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method of inhibiting fertility which comprises employing the 
above mentioned method to identifying odorant ligands which 
inhibit the interaction between the odorant ligands and the 
odorant receptors associated with fertility, 

5 

This invention is illustrated in the Experimental Detail 
section which follow. These sections are set forth to aid 
in an understanding of the invention but are not intended 
to, and should not be construed to, limit in any way the 
10 invention as set forth in the claims which follow 
thereafter. 

EXPERIMENTAL DETAILS 

15 

MATERIALS AND METHODS 

Polymerase Chain Reaction 

20 RNA was prepared from the olfactory epithelia of Sprague 
Dawley rats according to Chirgwin et al. (40) or using 
RNAzol B (Cinna/Biotecx) and then treated with DNase I (0.1 
unit/jig RNA) (Promega) . In order to obtain cDNA, this RNA 
was incubated at 0.1 nq/iil with 5 random hexamers 

25 (Pharmacia) 1 mM each of dATP, dCTP, dGTP, TTP, and 2 
units//U RNase inhibitor (Promega) in 10 mM TrisCl (pH 8.3), 
50 mM KC1, 2.5 mM MgCl 2 , and 0,001% gelatin for 10 min. at 
22 °C, and then for a further 45 min. at 37°C following the 
addition of 20 u./m* of Moloney murine leukemia virus 

30 reverse transcriptase (BRL) . After heating at 95 °C for 3 
min., cDNA prepared from 0.2 /ig of RNA was used in each of 
a series of polymerase chain reactions (PCR) containing 10 
mM TrisCl (pH 8.3), 50 mM KC1, 1.5 mM MgCl 2 , 0.001% gelatin, 
200 jiM each of dATP, dCTP, dGTP, and TTP, 2.5 u. Tag 

35 polymerase (Perkin Elmer Cetus) , and 2 \xYi of each PCR 



WO 92/17585 PCT/US92/02741 



-18- 

primer. PCR reactions were performed according to the 
following schedule: 96*>C for 45 sec, 55°C for 4 min. (or 
AS°C for 2 min.), 72°C for 3 min. with 6 sec. extension per 
cycle for 48 cycles. The primers used for PCR were a series 
5 of degenerate oligonucleotides made according to the amino 
acid seguences found in transmembrane domain 2 and 7 of a 
variety of different members of the 7 transmembrane domain 
protein super family (19) . The regions used correspond to 
amino acids number 60-70 and 286-295 of clone 115 (Figure 
10 4). Each of five different 5* primers were used in PCR 
reactions with each of six different 3' primers. The 5' 
primers had the sequences: 



C AC AC CT 
15 Al, AATTGGATICTIGTIAATCTIGCIGTIGCIGCIGA; 

C C CA A C C 
A2 , AATTATTTTCTIGTIAATCTIGCITTIGCIGA; 

2 0 CCA CC A C 

A3 , AATTTITTTATIATITCICTIGCITGIGCIGA; 

AT C T ACT C 

A4 , CGITTI CTIATGTGTAACCTITGCTTTGCIGA ; 

25 

C CT TG 

A5 , ACIGTITATATIACICATCTIACIATIGCIGA. 

The 3* primers were: 

30 

TTA T CAG CCA 

Bl , CTGICGGTTCATIAAIACATAIATIATIGGGTT; 

TG GA G G A A 

35 B2, GATCGTTIAGACAACAATAI ATI ATI GGGTT; 

AG G A 

B3 , TCI ATGTTAAAI GTI GTATAI ATI ATI GGGTT ; 

40 T G G A A 

B4 , GCCTTI GTAAAIATIGCATAIAGGAAI GGGTT ; 

G AGA G G G A 

B5 , AAATCIGGGCTICGICAATAIATCAIIGGGTT; 

45 



CT CT G G G G A 
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B6, GAIGAICCIACAAAAAAATAIATAAAIGGGTT. 

An aliquot of each PCR reaction was analyzed by agarose gel 
5 electrophoresis and bands of interest were amplified further 
by performing PCR reactions on pipet tip (approx. 1 ^1) 
plugs of the agarose gels containing those DNAs. Aliguots 
of these semi-purified PCR products were digested with the 
restriction enzymes Hae III or Hinf I and the digestion 
10 products were compared with the undigested DNAs on agarose 
gels* 

Isolation and Analysis of cDNA Clones 

15 CDNA libraries were prepared according to standard 
procedures (41, 42) in the cloning vector, XZAP II 
(Stratagene) using poly A + RNA prepared from Sprague Dawley 
rat epithelia (see above) or from an enriched population of 
olfactory neurons which had been obtained by a 'panning' 

20 procedure, using an antibody against the H blood group 
antigen (Chembiomed) found on a large percentage of rat 
olfactory neurons. In initial library screens, 8.5 X 10 5 
independent clones from the olfactory neuron library and 1.8 
X 10 6 clones from the olfactory epithelium library were 

25 screened (41) with a 32 P-labeled probe (prime-it, 
Stratagene) consisting of a pool of gel-isolated PCR 
products obtained using primers A4 and B6 (see above) in PCR 
reactions using as template, olfactory epithelium cDNA, rat 
liver DNA, or DNA prepared from the two cDNA libraries. In 

30 later library screens, a mixture of PCR products obtained 
from 20 cDNA clones with the A4 and B6 primers was used as 
probe ('PI 1 probe). In initial screens, phage clones were 
analyzed by PCR using primers A4 and B6 and those which 
showed the appropriate size species were purified. In later 

3 5 screens, all position clones were purified, but only those 
that could be amplified with the B6 primer and a primer 
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specific for vector sequence were analyzed further. To 
obtain plasmids from the isolated phage clones, phagemid 
rescue was performed according to the instructions of the 
manufacturer of A. ZAP II (Stratagene) . DNA sequence analysis 
5 was performed on plasmid DNAs using the Sequenase system 
(USB) , initially with the A4 and B6 primers and later with 
oligonucleotide primers made according to sequences already 
obtained . 

10 Northern and Southern Blot Analyses 

For Northern blots, poly A + RNAs from various tissues were 
prepared as described above or purchased from Clontech. One 
^g of each RNA was size fractionated on formaldehyde agarose 

15 gels and blotted onto nylon membranes (41, 42) . For 
Southern blots, genomic DNA prepared from Sprague Dawley rat 
liver was digested with the restriction enzymes Eco RI or 
Hind III, size fractionated on agarose gels and blotted onto 
nylon membranes (41, 42). The membranes were dried at 80°C, 

2 0 and then prehybridized in 0.5 M sodium phosphate buffer (pH 
7.3) containing 1% bovine serum albumin and 4% sodium 
dodecyl sulfate. Hybridization was carried out in the same 
buffer at 65°-70°C for 14-20 hrs. with DNAs labeled with 
32 P. For the first Northern blot shown, the 'PI ' probe (see 

25 above under cDNA clone isolation) was used. For the second 
Northern blot shown, a mix of PCR fragments from seven 
divergent cDNA clones was used. For southern blots, the 
region indicated in clone 115 by amino acids 118 through 251 
was amplified from a series of divergent cDNA clones using 

30 PCR. The primers used for these reactions had the 
sequences : 

PI, ATGGCITATGATCGITATGTIGC, and 
35 P4, AAIAGIGAIACIATIGAIAGATGIGAICC 
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These DNAs (or a DNA encompassing transmembrane domains 2 
through 7 for clone F6) were labeled and tested for 
crosshybridization at 70°C. Those DNAs which did not show 
appreciable crosshybridization were hybridized individually, 
5 or as a pool to Southern blots at 70°C. 

Rat Sequences used to obtain, similar sequences expressed in 
Humans 

10 There are genes similar to the rat genes discussed above 
present in humans, these genes may be readily isolated by 
screening human gene libraries with the cloned rate 
sequences or by performing PCR experiments on human genomic 
DNA with primers homologous to the rat sequences. First, 

15 PCR experiments were performed with genomic DNA from rat, 
human, mouse, and several other species. When primers 
homologous to transmembrane domains 2 and 6 (the A4/B6 
primer set used to isolate the original rat sequences) were 
used, DNA of the appropriate size was amplified from rat, 

20 human and mouse DNAs. When these primary PCR reactions were 
subsequently diluted and subjected to PCR using primers to 
internal sequences (PI and P4 primers) , smaller DNA species 
were amplified whose size was that seen when the same 
primers were used in PCR reactions with the cloned rat 

25 cDNAs. Similarly, when the secondary PCR was performed with 
one outer primer together with one inner primer (ie. A4/P4 
or P1/B6) , amplified DNAs were obtained whose sizes were 
also consistent with the amplification of genes similar in 
sequence and organization to the cloned rat cDNAs. Second, 

30 a mix of segments from 20 of the rat cDNAs ('PI" probe) was 
used to screen libraries constructed from human genomic 
DNAs . Hybridization under high or low stringency conditions 
reveals the presence of a large number of cloned human DNA 
segments that are homologous to the rat sequences. 

3 5 Finally, RNA from a human olfactory tumor (neuroesthesioma, 
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NCI-H-1011) cell line has been examined for sequences 
homologous to those cloned in the rat. cDNA prepared from 
this RNA was subjected to PCR with the A4/B6 primer set and 
a DNA species of the appropriate size was seen. This DNA 
5 was subcloned and partially sequenced and clearly encodes a 
member of the olfactory protein family identified in the 
rat. 

The inserted sequence in human clones H3/H5 was amplified by 
10 PCR with the A4/B6 primers, gel purified, and then labeled 
with 32P. The labeled DNA was then hybridized to 
restriction enzyme human placenta. Multiple hybridizing 
species were observed with each DNA (See Figure 32) . This 
observation is consistent with the presence of a family of 
15 odorant receptor genes in the human genome. 

The sequence of clone H5 is hereby shown in Figure 19. In 
addition, the translated protein sequence is shown in Figure 
19. 

20 

In other to identify odorant receptors in other species, 
degenerated primer oligonucleotides homologous to conserved 
regions within the rat odorant receptor family may be used 
in PCR reactions with genomic DNA or with cDNA prepared from 
25 olfactory tissue RNA from those species. 

RESULTS 

Cloning the Gene Family 

30 A series of degenerate oligonucleotides were designated 
which could anneal to conserved regions of members of the 
superfamily of G-protein coupled seven transmembrane domain 
receptor genes. Five degenerate oliogonucleotides (Al-5; 
see Experimental Procedures) matching sequences within 

35 transmembrane domain 2, and six degenerate oligonucleotides 
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(Bl-6) matching transmembrane domain 7 were used in all 
combinations in PCR reactions to amplify homologous 
sequences in cDNA prepared from rat olfactory epithelium 
RNA. The amplification products of each PCR reaction were 
5 then analyzed by agarose gel electrophoresis. Multiple 
bands were observed with each of the primer combinations. 
The PCR products within the size range expected for this 
family of receptors (600 to 1300 bp) were subsequently 
picked and amplified further with the appropriate primer 
10 pair in order to isolate individual PCR bands. Sixty-four 
PCR bands isolated in this fashion revealed only one or a 
small number of bands upon agarose gel electrophoresis. 
Representatives of these isolated PCR products are shown in 
Figure 2A. 

15 

The isolated PCR products were digested with the 
endonuclease , Hae III or Hinf I, which recognize four base 
restriction sites and cut DNA at frequent intervals. In 
most instances, digestion of the PCR product with Hinf I 

20 generated a set of fragments whose molecular weights sum to 
the size of the original DNA (Figure 2B) . These PCR bands 
are therefore likely to each contain a single DNA species. 
In some cases, however, restriction digestion yielded a 
series of fragments whose molecular weights sum to a value 

25 greater than that of the original PCR product. The most 
dramatic example is shown in Figure 2 where the 710 bp, PCR 
13 DNA, is cleaved by Hinf I to yield a very large number of 
restriction fragments whose sizes sum to a value five- to 
ten-fold greater than that of the original PCR product. 

30 These observations indicated that PCR product 13 consists of 
a number of different species of DNA, each of which could be 
amplified with the same pair of primer oligonucleotides. In 
addition, when PCR experiments similar to those described 
were performed using cDNA library DNAs as templates, a 710 

35 bp PCR product was obtained with the PCR13 primer pair 
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(A4/B6) with DNA from olfactory cDNA libraries, but not a 
glioma cDNA library. Moreover, digestion of one of this 710 
bp product also revealed the presence of multiple DNA 
species. In other cases (see PCR product 20, for example) , 
digestion yielded a series of restriction fragments whose 
molecular weights also sum to a size greater than the 
starting material. Further analysis, however, revealed that 
the original PCR product consisted of multiple bands of 
similar but different si2es. 

In order to determine whether the multiple DNA species 
present in PCR 13 encode members of a family of seven 
transmembrane domain proteins, PCR 13 DNA was cloned into 
the plasmid vector Bluescript and five individual clones 
were subjected to DNA sequence analysis. Each of the five 
clones exhibited a different DNA sequence, but each encoded 
a protein which displayed conserved features of the 
superfamily of seven transmembrane domain receptor proteins. 
In addition, the proteins encoded by all five clones shared 
distinctive sequence motifs not found in other superfamily 
members indicating they were all members of a new family of 
receptors. 

To obtain full-length cDNA clones, cDNA libraries prepared 
from olfactory epithelium RNA or from RNA of an enriched 
population of olfactory sensory neurons were screened. The 
probe used in these initial screens was a mixture of PCR 13 
DNA as well as DNA obtained by amplication of rat genomic 
DNA or DNA from two olfactory cDNA libraries with the same 
primers used to generate PCR 13 (A4 and B6 primers) . 
Hybridizing plaques were subjected to PCR amplification with 
the A4/B6 primer set and only those giving a PCR product of 
the appropriate size (approximately 710 bp) were purified. 
The frequency of such positive clones in the enriched 
olfactory neuron cDNA library was approximately five times 
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greater than the frequency in the olfactory epithelium cDNA 
library* The increased frequency of positive clones 
observed in the olfactory neuron library is comparable to 
the enrichment in olfactory neurons generally obtained in 
5 the purification procedure. 

The original pair of primers used to amplify PCR 13 DNA were 
then used to amplify coding segments of 20 different cDNA 
clones. A mix of these PCR products were labeled and used 

10 as probe for further cDNA library screens. This mixed probe 
was also used in a Northern blot (Figure 3) to determine 
whether the expression of the gene family is restricted to 
the olfactory epithelium. The mixed probe detects two 
diffuse bands centered at 2 and 5 kb in RNA from olfactory 

15 epithelium; no hybridization can be detected in brain or 
spleen. (Later experiments which examined a larger number 
of tissue RNAs with a more restricted probe will be shown 
below.) Taken together, these data indicate the discovery 
of a novel multigene family encoding seven transmembrane 

20 domain proteins which are expressed in olfactory epithelium, 
and could be expressed predominantly or exclusively in 
olfactory neurons. 

The Protein Sequences of Numerous, Olf actorv-specif ic 
25 Members of the Seven Transmembrane Domain Super family 

Numerous clones were obtained upon screening cDNA libraries 
constructed from olfactory epithelium and olfactory neuron 
RNA at high stringency. Partial DNA sequences were obtained 

30 from 36 clones; 18 of these cDNA clones are different, but 
all of them encode proteins which exhibit shared sequence 
motifs indicating that they are members of the family 
identified in PCR 13 DNA. A complete nucleotide seguence 
was determined for coding regions of ten of the most 

35 divergent clones (Figure 4). Th deduced protein sequences 
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of these cDKAs defines a new multigene family which shares 
sequence and structural properties with the superf amily of 
neurotransmitter and hormone receptors that traverse the 
membrane seven times. This novel family, however, exhibits 
5 features different from any other member of the receptor 
superf amily thus far identified. 

Each of the ten sequences contains seven hydrophobic 
stretches (19-26 amino acids) that represent potential 

10 transmembrane domains. These domains constitute the regions 
of maximal sequence similarity to other members of the seven 
transmembrane domain superf amily (see legend to Figure 4). 
On the basis of structural homologies with rhodopsin and the 
B-adrenergic receptors, (19) it is likely that the amino 

15 termini of the olfactory proteins are located on the 
extracellular side of the plasma membrane and the carboxyl 
termini are located in the cytoplasm. In this scheme, three 
extracellular loops alternate with three intracellular loops 
to link the seven transmembrane domains (see Figure 5) . 

20 Analysis of the sequences in figure 4 demonstrates that the 
olfactory proteins, like other members of the receptor 
superf amily, display no evidence of an N-terminal signal 
sequence. As in several other superf amily members, a 
potential N-linked glycosylation site is present in all ten 

25 proteins within the short N-terminal extracellular segment. 
Other structural features conserved with previously 
identified members of the superf amily included cysteine 
residues at fixed positions within the first and second 
extracellular loops that are thought to form a disulfide 

30 bond. Finally, many of the olfactory proteins reveal a 
conserved cysteine within the C-terminal domain which may 
serve as a palmitoylation site anchoring this domain to the 
membrane (21). These features, taken together with several 
short, conserved sequence motifs (see legend to Figure 4), 

35 clearly define this new family as a member of the 
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superfamily of genes encoding the seven transmembrane domain 
receptors . 

There are, however, important differences between the 
5 olfactory protein family and the other seven transmembrane 
domain proteins described previously and these differences 
may be relevant to proposed function of these proteins in 
odor recognition. Structure-function experiments involving 
in vitro mutagenesis suggest that adrenergic ligands 

10 interact with this class of receptor molecule by binding 
within the plane of the membrane (22, 20). Not 
surprisingly, small receptor families that bind the same 
class of ligands, such as the adrenergic and muscarinic 
acetylcholine receptor families exhibit maximum sequence 

15 conservation (often over 80%) within the transmembrane 
domains. In contrast, the family of receptors discussed in 
this application shows striking divergence within the third, 
fourth, and fifth transmembrane domains (Figure 4) . The 
variability in the three central transmembrane domains is 

20 highlighted schematically in Figure 5. The divergence in 
potential ligand binding domains is consistent with the idea 
that the family of molecules cloned is capable of 
associating with a large number of odorant of diverse 
molecular structure. 

25 

Receptors which belong to the superfamily of seven 
transmembrane domain proteins interact with G-proteins to 
generate intracellular signals. In vitro mutagenesis 
experiments indicate that one site of association between 

30 receptor and G-protein resides within the third cytoplasmic 
loop (22, 23). The sequence of this cytoplasmic loop in 18 
different clones we have characterized is shown in Figure 
6A. This loop which is often quite long and of variable 
length in the receptor superfamily is relatively short (only 

35 17 amino acids) and of fixed length in the 18 clones 
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examined. Eleven of the 18 different clones exhibit the 
sequence motif K/R I V S S I (or a close relative) at the N- 
terminus of this loop. Two of the cDNA clones reveal a 
different HIT C/W A V motif at this site. If this short 
5 loop is a site of contact with G-proteins, it is possible 
that the conserved motifs may reflect sites of interaction 
with different G-proteins that activate different 
intracellular signalling systems in response to odors. In 
addition, the receptors cloned reveal several serine or 
10 threonine residues within the third cytoplasmic loop. By 
analogy with other G-protein coupled receptors, these 
residues may represent sites of phosphorylation for specific 
receptor kinases involved in desensitization. (24) 

15 Subfamilies within the Multigene Family 

Figure 6A displays the sequences of the fifth transmembrane 
domain and the adjacent cytoplasmic loop encoded by L8 of 
the cDNA clones we have analyzed. As a group, the 18 

20 sequences exhibit considerable divergence within this 
region. The multigene family, however, can be divided into 
subfamilies such that the members of a given subfamily share 
significant sequence conservation. In subfamily B, clones 
F12 and F13, for example, differ from one another at only 

25 four of 44 positions (91% identify) , and clearly define a 
subfamily. Clones F5 and 111 (subfamily D) differ from F12 
and F13 at 34-36 positions within this region and clearly 
define a separate subfamily. Thus, this olfactory-specific 
multigene family consists of highly divergent subfamilies. 

30 If these genes encode odor receptors, it is possible that 
members of the divergent subfamilies bind odor ant of widely 
differing structural classes. Members of the individual 
subfamilies could therefore recognize more subtle 
differences between molecules which belong to the same 

35 structural class of molecules structures. 
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The Size of the Multiqene Family 

Genomic Southern blotting experiments were preformed and 
genomic libraries were screened to obtain an estimate of the 
5 sizes of the multigene family and the member subfamilies 
encoding the putative odor receptors. DNAs extending from 
the 3 1 end of transmembrane domain 3 to the middle of 
transmembrane domain 6 were synthesized by PCR from DNA of 
seven of the divergent cDNA clones (Figure 4). In initial 

10 experiments, these DNAs were labeled and hybridized to each 
other to define conditions under which minimal 
crosshybridization would be observed among the individual 
clones. At 70 °C, the seven DNAs showed no 

crosshybridization, or crosshybridized only very slightly. 

15 The trace levels of crosshybridization observed are not 
likely to be apparent upon genomic Southern blot analysis 
where the amounts of DNA are far lower than in the test 
cross. 

20 Probes derived from these seven DNAs were annealed under 
stringent conditions, either individually or as a group, to 
Southern blots of rat liver DNA digested with the 
restriction endonucleases Eco RI or Hind III (Figure 7) . 
Examination of the Southern blots reveals that all but one 

25 of the cDNAs detects a relatively large, distinctive array 
of bands in genomic DNA. Clone 115 (probe 7) , for example, 
detects about 17 bands with each restriction endonuclease, 
whereas clone F9 (probe l) detects only about 5-7 bands with 
each enzyme. A single band is obtained with clone 17 (probe 

30 5). PCR experiments using nested primers (TM2/TM7 primers 
followed by primers to internal sequences) and genomic DNA 
as template indicate that the coding regions of the members 
of this multigene family, like those of many members of the 
G-protein coupled superfamily, may not be interrupted by 

35 introns. This observation, together with the fact that most 
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of the probes only encompasses 400 nucleotides suggests that 
each band observed in these experiments is likely to 
represent a different gene. These data suggest that the 
individual probes chosen are representatives of subfamilies 
5 which range in size from a single member to as many as 17 
members. A total of about 70 individual bands were detected 
in this analysis which could represent the presence of at 
least 70 different genes. Although the DNA probes used in 
these blots did not crosshybridize appreciably with each 

10 other, it is possible that a given gene might hybridize to 
more than one probe, resulting in an overestimate of gene 
number. However, it is probable that the total number of 
bands only reflects a minimal estimate of gene number since 
it is unlikely that we have isolated representative cDNAs 

15 from all of the potential subfamilies and the hybridizations 
were performed under conditions of very high stringency. 

A more accurate estimate of the size of the olfactory- 
specific gene family was obtained by screening rat genomic 

20 libraries. The mix of the seven divergent probes used in 
Southern blots, or the mix of 20 different probes used in 
our initial Northern blots (see Figure 3), were used as 
hybridization probes under high (€5°C) or lowered (55°C) 
stringency conditions in these experiments. Nested PGR (see 

25 above) was used to verify that the clones giving a positive 
signal under low stringency annealing conditions were indeed 
members of this gene family. It is estimated from these 
studies that there are between 100 and 200 positive clones 
per haploid genome. The estimate of the size of the family 

30 obtain from screens of genomic libraries again represents a 
lower limit. Given the size of the multigene family, one 
might anticipate that many of these genes are linked such 
that a given genomic clone may contain multiple genes. Thus 
the data from Southern blotting and screens of genomic 

35 libraries indicate that the multigene family identified 
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consists of one to several hundred member genes which can be 
divided into multiple subfamilies. 

It should be noted that the cDNA probes isolated may not be 
5 representative of the full complement of subfamilies within 
the larger family of olfactory proteins. The isolation of 
cDNAs, for example, relies heavily on PCR with primers from 
transmembrane domains 2 and 7 and biases our clones for 
homology within these regions. Thus, estimates of gene 
10 number as well as subsequent estimates of RNA abundance 
should be considered as minimal. 



Expression of the Members of this Multiqene Family 



15 Additional Northern blot analyses were preformed to 
demonstrate that expression of the members of this gene 
family is restricted to the olfactory epithelium. (Figure 8) 
Northern blot analysis with a mixed probe consisting of the 
seven divergent cDNAs used above reveals two diffuse bands 

20 about 5 and 2 kb in length in olfactory epithelium RNA. 
This pattern is the same as that seen previously with the 
mix of 20 DNAs. No annealing is observed to RNA from the 
brain or retina or other, nonneural tissues, including lung, 
liver, spleen, and kidney. 

25 

An estimate of the level of expression of this family can be 
obtained from screens of cDNA libraries. The frequency of 
positive clones in cDNA libraries made from olfactory 
epithelium RNA suggests that the abundance of the RNAs in 

30 the epithelium is about one in 20,000. The frequency of 
positive clones is approximately five-fold higher in a cDNA 
library prepared from RNA from purified olfactory neurons 
(in which 75% of the cells are olfactory neurons) . The 
increased frequency of positive clones obtained in the 

35 olfactory neuron cDNA library is comparable to the 
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enrichment we obtain upon purification of olfactory neurons. 
These observations suggest that this multigene family is 
expressed largely, if not solely, in olfactory neurons and 
may not be expressed in other cell types within the 
5 epithelium. If each olfactory neuron contains 10 5 mRNA 
molecules, from the frequency of positive clones we predict 
that each neuron contains only 25-30 transcripts derived 
from this gene family. Since the family of olfactory 
proteins consists of a minimum of a hundred genes, a given 
10 olfactory neuron could maximally express only a proportion 
of the many different family members. These values thus 
suggest that olfactory neurons will exhibit significant 
diversity at the level of expression of these olfactory 
proteins. 

15 

Identification of pheromone receptors in vomeronasal organ 
The vomeronasal organ (vomeronasal gland) is an accessory 
olfactory structure that is located near the nasal cavity. 
Like the olfactory epithelium of the nasal cavity, the 

20 olfactory epithelium of the vomeronasal organ contains 
olfactory sensory neurons. The vomeronasal organ is 
believed to play an important role in the sensing of 
pheromones in numerous species. Pheromones are believed to 
have profound effects on both physiological and behavioral 

25 aspects of reproduction. the identification of pheromone 
receptors would permit the identification of the pheromones 
themselves. It would also enable one to identify agonists or 
antagonists that would either mimic the pheromones or block 
the pheromone receptors from transducing pheromone signals. 

3 0 Such information would be important to the development of 
species specific pesticides and, conversely, to animal 
husbandry. The identification of pheromone receptors in 
human could ultimately lead to the development of 
contraceptives or to treatments for infertility in humans. 

35 It is likely that the identification of pheromone receptors 
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in low mammals such as rodents would lead to the 
identification of similar receptors in human. 

In order to identify potential pheromone receptors, we 
5 isolate RKA from the vomeronasal organs of female rats and 
prepared cDNA from this RNA. The cDNA was subjected to PCR 
with several different pairs of degenerate oligonucleotide 
primers that match sequences present in the rat odorant 
receptor family. The PCR products were subcloned and the 

10 nucleotide sequences of the subcloned DNAs were determined. 
Each of the subcloned DNAs encodes a protein that belongs to 
the odorant receptor family. The sequences of the following 
vomeronasal subclones are shown: Jl, J2, J4, J7, J8, Jll, 
J14, J15, J16, J17 ( J19, J20. In a few cases (J2, J4) f the 

15 same sequence was amplified with two different primer pairs 
and the sequence shown is a composite of the two sequences. 
It is possible that one or more of these molecules, or 
closely related molecules, serve as pheromone receptors in 
the rat. 

20 

DISCUSSION 

The mammalian olfactory system can recognize and 
discriminate a large number of odorous molecules, 

25 Perception in this system, as in other sensory systems, 
initially involves the recognition of external stimuli by 
primary sensory neurons. This sensory information is then 
transmitted to the brain where it is decoded to permit the 
discrimination of different odors. Elucidation of the logic 

3 0 underlying olfactory perception is likely to require the 
identification of the specific odorant receptors, the 
analysis of the extent of receptor diversity and receptor 
specificity, as well as an understanding of the pattern of 
receptor expression in the olfactory epithelium. 

35 
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The odorant receptors are thought to transduce intracellular 
signals by interacting with G-proteins which activate second 
messenger systems (12, 13, 14, 15). These proteins are 
clearly members of the family of G-protein coupled receptors 
5 which traverse the membrane seven times (19). The odorant 
receptors should be expressed specifically in the tissue in 
which odorant are recognized. The family of olfactory 
proteins cloned is expressed in the olfactory epithelium. 
Hybridizing RNA is not detected in brain or retina, or in a 

10 host of nonneural tissues. Moreover, expression of this 
gene family the epithelium may be restricted to olfactory 
neurons. The family of odorant receptors must be capable of 
interacting with extremely diverse molecular structures. 
The genes cloned are members of any extremely large 

15 multigene family which exhibit variability in regions 
thought to be important in ligand binding. The possibility 
that each member of this large family of seven transmembrane 
proteins is capable of interacting with only one or a small 
number of odorant provides a plausible mechanism to 

20 accommodate the diversity of odor perception. The 
properties of the gene family identified suggests that this 
family is likely to encode a large number of distinct 
odorant receptors. 

25 Size of the Multigene Family 

The size of the receptor repertoire is likely to reflect the 
range of detectable odors and the degree of structural 
specificity exhibited by the individual receptors. It has 

30 been estimated that humans can identify over 10,000 
structurally-distinct odorous ligands. However, this does 
not necessarily imply that humans possess an equally large 
repertoire of odorant receptors. For example, binding 
studies in lower vertebrates suggest that structurally- 

35 related odorant may activate the same receptor molecules. 
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In fish which smell amino acids, the binding of alanine to 
isolated cilia can be competed by other small polar residues 
(threonine and serine) , but not by the basic amino acids, 
lysine or arginine (11) . These data suggest that individual 
5 receptors are capable of associating with several 
structurally-related ligands, albeit with different 
affinities. Stereochemical models of olfactory recognition 
in mammals (25) (based largely on psychophysical, rather 
than biophysical data) have suggested existence of several 

10 primary odor groups including camphoraceous, musky, 
peppermint, ethereal, pungent, and putrid. In such a model, 
each group would contain odor ant with common molecular 
configurations which bind to common receptors and share 
similar odor qualities. 

15 

Screens of genomic libraries with mixed probes consisting of 
divergent family members detect approximately 100 to 200 
positive clones per genome. The present estimate of at 
least 100 genes provides only a lower limit since it is 

20 likely that the probes used do not detect all of the 
possible subfamilies. Moreover, it is probable that many of 
these genes are linked such that a given genomic clone may 
contain multiple genes. It is therefore expected that the 
actual size of the gene family may be considerably higher 

25 and this family of putative odor ant receptors could 
constitute one of the largest gene families in the genome. 

The character Nation of a large multigene family encoding 
putative odorant receptors suggests that the olfactory 

3 0 system utilizes a far greater number of receptors than the 
visual system. Color vision, for example, allows the 
discrimination of several hundred hues, but is accomplished 
by only three different photoreceptors (1, 2, 3 and 4) . The 
photoreceptors each have different, but overlapping 

35 absorption spectra which cover the entire spectrum of 
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visible wavelengths. Discrimination of color results from 
comparative processing of the information from these three 
classes of photoreceptors in the brain. Whereas three 
photoreceptors can absorb light across the entire visible 
5 spectrum, our data suggest that a small number of odorant 
receptors cannot recognize and discriminate the full 
spectrum of distinct molecular structures perceived by the 
mammalian olfactory system. Rather, olfactory perception 
probably employs an extremely large number of receptors each 
10 capable of recognizing a small number of odorous ligands. 

Diversity within the Gene Family and the Specificity of Odor 
Recognition 

15 The olfactory proteins identified in this application are 
clearly members of the superfamily of receptors which 
traverse the membrane seven time. Analysis of the proteins 
encoded by the 18 distinct cDNAs we have cloned reveals 
structural features which may render this family 

20 particularly well suited for the detection of a diverse 
array of structurally distinct odorant. Experiments with 
other members of this class of receptors suggest that the 
ligand binds to its receptor within the plane of the 
membrane such that the ligand contacts many, if not all of 

25 the transmembrane helices. The family of olfactory proteins 
can be divided into several different subfamilies which 
exhibit significant sequence divergence within the 
transmembrane domains. Nonconservative changes are commonly 
observed within blocks of residues in transmembrane regions 

30 3, 4, and 5 (Figures A f 5, 6); these blocks could reflect 
the sites of direct contact with odorous ligands. Some 
members, for example, have acidic residues in transmembrane 
domain 3, which in other families are thought to be 
essential for binding aminergic ligands (20) while other 

35 members maintain hydrophobic residues at these positions. 
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This divergence within transmembrane domains may reflect the 
fact that the members of the family of odorant receptors 
must associate with odorant of widely different molecular 
structures . 

5 

These observations suggest a model in which each of the 
individual subfamilies encode receptors which bind distinct 
structural classes of odorant. Within a given subfamily, 
however, the sequence differences are far less dramatic and 

10 are often restricted to a small number of residues. Thus, 
the members of a subfamily may recognize more subtle 
variations among odor molecules of a given structural class. 
At a practical level, individual subfamilies may recognize 
grossly different structures such that one subfamily may 

15 associate, for example, with the aromatic compound, benzene 
and its derivatives, whereas a second subfamily may 
recognize odorous, short chain, aliphatic molecules. Subtle 
variations in the structure of the receptors within, for 
example, the hypothetical benzene subfamily could facilitate 

20 the recognition and discrimination of various substituted 
derivatives such as toluene, xylene or phenol. It should be 
noted that such a model, unlike previous stereochemical 
models, does not necessarily predict that molecules with 
similar structures will have similar odors. The activation 

25 of distinct receptors with similar structures could elicit 
different odors, since perceived odor will depend upon 
higher order processing of primary sensory information. 

Evolution of the Gene Family and the Generation of Diversity 

30 

Preliminary evidence from PCR analyses suggests that members 
of this family of olfactory proteins are conserved in lower 
vertebrates as well as invertebrates. This gene family 
presumably expanded over evolutionary time providing mammals 
35 with the ability to recognize an increasing diversity of 
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odorant. Examination of the sequences of the family members 
cloned from mammals provides some insight into the evolution 
of this multigene family. Although the chromosomal loci 
encoding these genes has yet to be characterized, it is 
5 likely that at least some member genes will be tandem ly 
arranged in a large cluster as is observed with other large 
multigene families. A tandem array of this sort provides a 
template for recombination events including unequal crossing 
over and gene conversion, that can lead to expansion and 
10 further diversification of the sort apparent among the 
family members we have cloned (26). 

The multigene family encoding the olfactory proteins is 
large: all of the member genes clearly have a common 

15 ancestral origin, but have undergone considerable divergence 
such that individual genes encode proteins that share from 
40-80% amino acid identity. Subfamilies are apparent with 
groups of genes sharing greater homology among themselves 
than with members of other subfamilies. Examination of the 

20 sequences of even the most divergent subfamilies, however, 
reveals a pattern in which several blocks of conserved 
residues are interspersed with variable regions. This 
segmental homology is conceptually similar to the 
organization of framework and hypervariable domains within 

25 the families of immunoglobulin and T cell receptor variable 
region sequences (27, 28). This analogy goes beyond 
structural organization and may extend to the function of 
these two gene families; each family consists of a large 
number of genes which have diversified over evolutionary 

30 time to accommodate the binding of a highly diverse array of 
ligands. The evolutionary mechanisms responsible for the 
diversification and maintenance of these large gene families 
may also be similar. It has been suggested that gene 
conversion has played a major role in the evolution of 

35 immunoglobulin and T cell receptor variable domains (29, 30 
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and 31) . Analysis of the sequence of the putative olfactory 
receptors reveals at least one instance where a motif from 
a variable region of one subfamily is found imbedded in the 
otherwise divergent sequence of a second subfamily, 
5 suggesting that conversion has occurred. Such a mixing of 
motifs from one subfamily to another over evolutionary time 
would provide additional combinatorial possibilities leading 
to the generation of diversity. 

It should be noted, however, that the combinatorial joining 
of gene segments by DNA rearrangement during development, 
which is characteristic of immunoglobulin loci (27) , is not 
a feature of the putative odor receptor gene family. No 
evidence for DNA rearrangement to generate the diversity of 
genes cloned has been observed. The entire coding region 
has been sequenced along with parts of the 5* and 3' 
untranslated regions of 10 different cDNA clones. The 
sequences of the coding regions are all different; no 
evidence has been obtained for constant regions that would 
suggest DNA rearrangement of the sort seen in the immune 
system. The observations indicate that the diversity 
olfactory proteins are coded by a large number of distinct 
gene sequences. 

25 Although it is unlikely from the data that DNA rearrangement 
is responsible for the generation of diversity among the 
putative odorant receptors, it remains possible that DNA 
rearrangements may be involved in the regulation of 
expression of this gene family. If each olfactory neuron 

30 expresses only one or a small number of genes, then a 
transcriptional control mechanism must be operative to 
choose which of the more than one hundred genes within the 
family will be expressed in a given neuron. Gene conversion 
from one of multiple silent loci into a single active locus, 

35 as observed for the trypanosome-variable surface 
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glycoproteins (32), provides one attractive model. The gene 
conversion event could be stochastic, such that a given 
neuron could randomly express any one of several hundred 
receptor genes, or regulated (perhaps by positional 
5 information) , such that a given neuron could only express 
one or a small number of predetermined receptor types. 
Alternatively, it is possible that positional information in 
the olfactory epithelium controls the expression of the 
family of olfactory receptors by more classical mechanisms 

10 that do not involve DNA rearrangement. What ever mechanisms 
will regulate the expression of receptor genes within this 
large, multigene family, these mechanisms must accommodate 
the requirement that olfactory neurons are regenerated every 
30-60 days (8) and therefore the expression of the entire 

15 repertoire of receptors must be accomplished many times 
during the life of an organism. 

Receptor Diversity and the Central Processing of Olfactory 
Information 

20 

The results suggest the existence of a large family of 
distinct odorant receptors. Individual members of this 
receptor family are likely to be expressed by only a small 
set of the total number of olfactory neurons. The primary 

25 sensory neurons within the olfactory epithelium will 
therefore exhibit significant diversity at the level of 
receptor expression. The question then emerges as to 
whether neurons expressing the same receptors are localized 
in the olfactory epithelium. Does the olfactory system 

30 employ a topographic map to discriminate among the numerous 
odorant? The spatial organization of distinct classes of 
olfactory sensory neurons, as defined by receptor 
expression, can now be determined by using the procedures of 
in situ hybridization and immunohistochemistry with probes 

35 specific for the individual receptor subtypes. This 
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information should help to distinguish between different 
models that have been proposed to explain the coding of 
diverse odorant stimuli (33). 

■ 

5 In one model, sensory neurons that express a given receptor 
and respond to a given odorant may be localized within 
defined positions within the olfactory epithelium. This 
topographic arrangement would also be reflected in the 
projection of olfactory sensory axons into discrete regions 

10 (glomeruli) within the olfactory bulb. In this scheme, the 
central coding to permit the discrimination of discrete 
odorant would depend, in part, on the spatial segregation of 
different receptor populations. Attempts to discern the 
topographic localization of specific receptors at the level 

15 of the olfactory epithelium has led to conflicting results. 
In some studies, electrophysiological recordings have 
revealed differences in olfactory responses to distinct 
odorant in different regions of the olfactory epithelium 
(34, 35). However, these experiments have been difficult to 

20 interpret since the differences in response across the 
epithelium are often small and are not observed in all 
studies (36) . 

A second model argues that sensory neurons expressing 
25 distinct odorant receptors are randomly distributed in the 
epithelium but that neurons responsive to a given odorant 
project to restricted regions within the olfactory bulb. In 
this instance, the discrimination of odors would be a 
consequence of the position of second order neurons in the 
30 olfactory bulb, but would be independent of the site of 
origin of the afferent signals within the epithelium. 
Mapping of the topographic projections of olfactory neurons 
has been performed by extracellular recordings from 
different regions of the bulb (37, 38) and by 2-deoxyglucose 
35 autoradiography to map regional activity after exposure to 
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different odorant (39) . These studies suggest that 
spatially-localized groups of bulbar neurons preferentially 
respond to different odorant. The existence of specific 
odorant receptors, randomly distributed through the 
5 olfactory epithelium, which converge on a common target 
within the olfactory bulb, would raise additional questions 
about the recognition mechanisms used to guide these 
distinct axonal subsets to their central targets. 

10 Other sensory systems also spatially segregate afferent 
input from primary sensory neurons. The spatial segregation 
of information employed, for example, by the visual and 
somatosensory systems, is used to define the location of the 
stimulus within the external environment as well as to 

15 indicate the quality of the stimulus. In contrast, 
olfactory processing does not extract spatial features of 
the odorant stimulus. Relieved of the necessity to encode 
information about the spatial localization of the sensory 
stimulus, it is possible that the olfactory system of 

20 mammals uses the spatial segregation of sensory input solely 
to encode the identity of the stimulus itself. The 
molecular identification of the genes likely to encode a 
large family of olfactory receptors should provide initial 
insights into the underlying logic of olfactory processing 

25 in the mammalian nervous system. 
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CTTTGTGGAC 


ATCTGTTTCA 


TTTCAACCAC 


TGTTCCAAAG 


240 


ATGTTAGTGA 


ACATCCAGAC 


CCAAAACAAT 


GTCATCACCT 


ATGCAGGATG 


CATTACCCAG 


300 


ATATACTTTT 


TCTTGCTCTT 


TGTAGAATTG 


GACAACTTCT 


TGCTGACTAT 


CATGGCCTAT 


360 


GACCGTTACG 


TAGCCATCTC 


TCACCCCATC 


CACTACACAG 


TTATCATCAA 


CTACAAGCTC 


420 


TGTGGATTTC 


TGGTTCTGGT 


ATCTTGGATT 


GTAAGTGTTC 


TCCATGCCTT 


GTTTCAAAGC 


480 


TTGATGATGT 


TGGCGCTGCC 


CTTCTGCACA 


CATCTGGAAA 


TCCCACACTA 


CTTCTGTGAA 


540 


CCTAATCAGG 


TGATTCAACT 


CACCTGTTCT 


GATGCATTTC 


TTAATGATCT 


TGTGATATAT 


600 


TTTACACTTG 


TGCTGCTGGC 


TACTGTTCCT 


CTTGCTGGCA 


TCTTCTATTC 


TTACTTCAAG 


660 


ATAGTGTCCT 


CCATATGTGC 


TATATCGTCA 


GTTCATGGGA 


AGTACAAAGC 


ATTCTCCACC 


720 


TGTGCATCTC 


ACCTTTCAGT 


CGTGTCTTTA 


TTTTACTGCA 


CAGGACTAGG 


AGTGTACCTC 


780 


AGTTCTGCTG 


CAAACAACAG 


CTCACAGGCA 


AGTCCCACAG 


CCTCAGTCAT 


GTACACTGTA 


840 


GTTACCCCTA 


TGGTGAACCC 


TTTTATCTAT 


AGTCTTAGGA 


ATAAAGATGT 


TAAGAGTGTT 


900 


CTGAAAAAAA 


CTCTTTGTGA 


GGAAGTTATA 


AGGAGTCCAC 


CTTCCCTACT 


TCATTTCTTC 


960 


CTAGTGTTAT 


GTCATCTCCC 


TTGTTTTATT 


TTTTGTTATT 


AA 




1002 



(2) INFORMATION FOR SEQ ID NO: 3: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 942 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: YES 
(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: rat olfactory epithelium 

(B) STRAIN: Srpague-Dawley rat 

(F) TISSUE TYPE: olfactory epithelium 

(vii) IMMEDIATE SOURCE: 
(B) CLONE: F5 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 
ATGAGCAGCA CCAACCAGTC CAGTGTCACC GAGTTCCTCC TCCTGGGACT CTCCAGGCAG 60 
CCCCAGCAGC AGCAGCTCCT CTTCCTGCTC TTCCTCATCA TGTACCTGGC CACTGTCCTG 120 
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GGAAACCTGC TCATCATCCT GGCTATTGCC ACAGACTCCC GCCTGCACAC CCCCATGTAC 180 

TTCTTCCTCA GTAACCTGTC CTTTGTGGAT GTCTGCTTCT CCTCTACCAC TGTCCCTAAA 240 

GTTCTGGCCA ACCATATACT TGGGAGTCAG GCCATTTCCT TCTCTGGGTG TCTCACCCAG 300 

CTGTATTTTC TCGCTCTCTT TGGTAACATG GACAATTTCC TGCTGGCTGT GATGTCCTAT 360 

GACCGATTTG TGGCCATATG CCACCCTTTA CACTACACAA CAAAGATGAC CCGTCAGCTC 420 

TGTGTCCTGC TTGTTGTGGG GTCATGGGTT GTAGCCAACA TGAATTGTCT GTTGCACATA 480 

CTGCTCATGG CTCGACTCTC CTTCTGTGCA GACAACATGA TCCCCCACTT CTTCTGTGAT 540 

GGAACTCCCC TCCTGAAACT CTCCTGCTCA GACACACATC TCAATGAGCT GATGATTCTT 600 

ACAGAGGGAG CTGTGGTCAT GGTCACCCCA TTTGTCTGCA TCCTCATCTC CTACATCCAC 660 

ATCACCTGTG CTGTCCTCAG AGTCTCATCC CCCAGGGGAG GATGGAAATC CTTCTCCACC 720 

TGTGGCTCCC ACCTGGCTGT GGTCTGCCTC TTCTATGGCA CCGTCATCGC TGTGTATTTC 780 

» 

AACCCATCAT CCTCTCACTT AGCTGGGAGG GACATGGCAG CTGCAGTGAT GTATGCAGTG 840 
GTGACCCCAA TGCTGAACCC TTTCATCTAT AGCCTGAGGA AC AG CG AC AT GAAAGCAGCT 900 
TTAAGGAAAG TGCTCGCCAT GAGATTTCCA TCTAAGCAGT AA 942 
(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 936 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: YES 
(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: rat olfactory epithelium 
<B) STRAIN: Srpague-Dawley rat 

(F) TISSUE TYPE: olfactory epithelium 

(vii) IMMEDIATE SOURCE: 

(B) CLONE : F6 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

ATGGCTTGGA GTACTGGCCA GAACCTGTCC ACACCAGGAC CATTCATCTT GCTGGGCTTC 60 

CCAGGGCCAA GGAGCATGCG CATTGGGCTC TTCCTGCTTT TCCTGGTCAT GTATCTGCTT 120 

ACGGTAGTTG GAAACCTAGC CATCATCTCC CTGGTAGGTG CCCACAGATG CCTACAGACA 180 

CCCATGTACT TCTTCCTCTG CAACCTCTCC TTCCTGGAGA TCTGGTTCAC CACAGCCTGC 240 

GTACCCAAGA CCCTGGCCAC ATTTGCGCCT CGGGGTGGAG TCATTTCCTT GGCTGGCTGT 300 
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GCCACACAGA TGTACTTTCT CTTTTCTTTG GGCTGTACCC AGTACTTCCT GCTGGCTGTG ■ 360 

ATGGCTTATG ACCCCTACCT GCCCATCTGC CTGCCACTGC GCTATGCTGG CATC ATG ACT 420 

CCTGCGCTGC CCATCCGCTT GGCCCTGGGA TCCTCCCTGT CTGGCTTTTC TCCAATCACA 480 

CTTCCTCCTA CCCTCATTCC CCGCCTCTCT TTCTCTGGCT CACGTCTCAT CAACCACTTC 540 

TTCTGTGACA TTTCGCCCTG GATAGTGCTT TCCTGCACCG ACACGCAGGT GGTGGAACTG 600 

GTCTCCTTTG GCATTGCCTT CTGTGTTATT CTGGGCTCGT GTGGTATCAC ACTACTCTCC 660 

TATGCTTACA TCATCACTAC CATCATCAAG ATTCCCTCTG CCCGGGGCCG GCACCGCGCC 720 

TTCTCAACCT GCTCATCCCA TCTCACTGTG CTGCTGATTT GGTATGCCTC CACCATCTTC 780 

TTGCATGTGA GGACCTCGCT AGAGAGCTCC TTGGACCTCA CCAAAGCTAT CACAGTGCTC 840 

AACACCATTG TCACACCTGT CCTGAACCCT TTCATATATA CTCTGAGGAA CAAGGATGTC 900 

AAGGAAGCTC TGCGCAGGAC GGTGAAGGGC AAGTGA 936 
(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 939 baee pairs 

(B) TYPE i nucleic acid 

(C) STRANDED NESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(ili) HYPOTHETICAL: YES 
(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: rat olfactory epithelium 

(B) STRAIN: Srpague-Dawley rat 

(F) TISSUE TYPE: olfactory epithelium 

(vii) IMMEDIATE SOURCE: 
(B) CLONE: 114 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5; 

ATGACTGGAA ATAACCAAAC TTTGATCTTG GAGTTCCTCC TCCTGGGTCT GCCCATCCCA 60 

TCAGAGTATC ATCTCCTCTT CTATGCCCTG TTCCTGCCCA TCTACCTCAC CATCATCCTG 120 

GGAAACCTGC TAATCATTGT CCTTGTTCGA CTGGACTCTC ATCTCCACAT GCCCATGTAC 180 

TTGTTTCTCA GCAACTTGTC CTTCTCTGAC CTCTGCTTTT CCTCTGTCAC AATGCCCAAA 240 

TTGCTTCAGA ACATGCAGAG CCAAGTACCA TCTATATCCT ATACAGGCTC CCTGACACAG 300 

CTGTACTTCT TTATGGTTTT TGGAGATATG CAGAGCTTCC TTCTTGTGGT CATGGCCTAT 360 

GACCGCTATG TGGCCATTTG CTTTCCTTTG CGTTACACCA CCATCATGAG CACCAAGTTC 420 

TGTGCTTCAC TAGTGCTACT TCTGTGGATG CTGACGATGA CCCATGCCCT GCTGCATACC 480 
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CTACTCATTG CTAGATTGTC TTTTTGTGAG AACAATGTGA TTCTTCACTT TTTCTGTGAC 540 

ATTTCTGCTC XTCTGAAGTT GTCCTGCTCA GACATTTATG TTAATGAGCT GATGATATAT 600 

ATCTTGGGTG GACTCATCAT TATTATCCCA TTCCTATTAA TTGTTATGTC CTATGTTAGA 660 

ATTTTCTTCT CCATTTTGAA GTTTCCATCT ATTCAGGACA TCTACAAGGT ATTCTCAACC 720 

TGTGGTTCCC ATCTGTCTGT GGTGACCTTG TTTTATGGGA CAATTTTTGG TATCTACTTA 780 

TGTCCATCAG GTAATAATTC TACTGTGAAG GAGATTGCCA TGGCTATGAT GTACACACTG 840 

GTGACTCCCA TGCTGAATCC CTTCATCTAC AGCCTGAGGA ACAGAGACAT GAAAAGGGCC 900 

CTAATAAGAG TTATCTGCAC TAAGAAAATC TCTCTGTAA 939 
(2) INFORMATION FOR SEQ ID NO:6: 

<i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 945 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: YES 
(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: rat olfactory epithelium 
<B> STRAIN: Srpague-Dawley rat 

<F) TISSUE TYPE: olfactory epithelium 

<vii) IMMEDIATE SOURCE: 

(B) CLONE: 115 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 

ATGACAGAAG AGAACCAAAC TGTGATCTCC CAGTTCCTTC TCCTTTTCCT GCCCATCCCC 60 

TCAGAGCACC AGCACGTGTT CTACGCCCTG TTCCTGTCCA TGTACCTCAC CACTGTCCTG 120 

GGGAACCTCA TCATCATCAT CCTCATTCAC CTGGACTCCC ATCTCCACAC ACCCATGTAC 180 

TTGTTTCTCA GCAACTTGTC CTTCTCTGAT CTCTGCTTTT CCTCTGTTAC GATGCCCAAG 240 

TTGTTGCAGA ACATGCAGAC CCAAGTTCCA TCCATCCCCT TTGCAGGCTG CCTGACACAA 300 

TTATACTTTT ACCTGTATTT TGCAGACCTT GAGAGCTTCC TCCTTGTGGC CATGGCCTAT 360 

GACCGCTATG TGGCCATCTG CTTCCCCCTT CATTACATGA GCATCATGAG CCCCAAGCTC 420 

TGTGTGAGTC TGGTGGTGCT GTCCTGGGTG CTGACCACCT TCCATGCCAT GCTGCACACC 480 

CTGCTCATGG CCAGATTGTC ATTCTGTGCG GACAATATGA TCCCCCACTT TTTCTGTGAT 540 

ATATCTCCTT TATTGAAACT GTCCTGCTCT GACACGCATG TTAATGAGTT GGTGATATTT 600 
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GTCATGGGAG GGCTTGTTAT TGTCATTCCA TTTCTGCTCA TCATTGTATC TTATGCACGA 660 

GTTGTCGCCT CCATTCTTAA AGTCCCTTCT GTCCGAGGCA TCCACAAGAT CTTCTCCACC 720 

TGCGGCTCCC ATCTGTCTGT GGTGTCACTG TTCTATGGCA CAATCATTGC TCTCTACTTA 780 

TGTCCGTCAG CTAATAACTC TACTGTGAAG GAGACTGTCA TCGCCATGAT GTACACAGTG 640 

GTGACCCCCA TGCTGAACCC CTTCATCTAC AGCCTGAGGA ACAGAGACAT GAAAGAGGCA 900 

CTGATAAGAG TCCTTTGTAA AAAGAAAATT ACCTTCTGTC TATGA 945 

(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 933 base pairs 
(8) TYPE 1 nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(U> MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: YES 
(iv) ANTI-SENSE: NO 

<vi) ORIGINAL SOURCE; 

(A) ORGANISM: rat olfactory epithelium 

(B) STRAIN: Srpague-Dawley rat 

(?) TISSUE TYPE: olfactory epithelium 

(vii) IMMEDIATE SOURCE: 
(B) CLONE: 13 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 



ATGAACAATC 


AAACTTTCAT 


CACCCAATTC 


CTTCTCCTGG 


GACTGCCCAT 


CCCTGAAGAA 


60 


CATCAGCACC 


TGTTCTATGC 


CTTGTTCCTG 


CTCATGTACC 


TCACCACCAT 


CTTGGGAAAC 


120 


TTGCTAATCA 


TTGTACTTGT 


TCAACTGGAC 


TCCCAGCTCC 


ACACACCTAT 


GTATTTGTTT 


180 


CTCAGCAATT 


TGTCTTTCTC 


TGATCTATCT 


TTTTCCTCTG 


TCACAATGCC 


CAAGCTGCTG 


240 


CAGAACATGA 


GGAGCCAGGA 


CACATCCATT 


CCCTATGGAG 


GCTGCCTGGC 


ACAAACATAC 


300 


TTCTTTATGG 


TTTTTGGAGA 


TATGGACAGT 


TTCCTTCTTG 


TGGCCATGGC 


CTATGACCGC 


360 


TATGTGGCCA 


TCTGCTTCCC 


TCTGCATTAC 


ACCAGCATCA 


TGAGCCCCAA 


CCTCTGTACT 


420 


TGTCTAGTGC 


TGTTATTGTG 


GATGCTGACG 


ACATCCCATG 


CCATGATGCA 


CACACTGCTT 


480 


GCAGCAAGAT 


TGTCTTTTTG 


TGAGAACAAT 


GTGGTCCTCA 


ACTTCTTCTG 


TGACCTATTT 


540 


GTTCTCCTAA 


AGCTGGCCTG 


CTCAGACACT 


TATATTAATG 


AGTTGATGAT 


ATTTATCATG 


600 


AGTACACTCC 


TCATTATTAT 


TCCATTCTTC 


CTCATTCTTA 


TGTCCTATGC 


AAGGATCATA 


660 


TCCTCTATTC 


TTAAGGTTCC 


ATCTACCCAA 


GGCATCTGCA 


AGGTCTTCTC 


TACCTGTGGT 


720 
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TCCCATCTGT CTGTACTATC ACTGTTCTAT GCGACAATTA TTGGTCTCTA CTTATGTCCA 



780 



CCAGGTAATA ATTCCACTCT AAAAGAGATG GTCATGGCCA TGATGTACAC TGTGGTGACC 



640 



CCCATGCTGA ATCCCTTCAT CTACAGCCTA AGGAATAGAG ATATGAAGAG GGCCCTAATA 



900 



AGAGTTATCT GTAGTATGAA AATCACTCTG TAA 



933 



(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS i 

(A) LENGTH i 984 baae pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

<li) MOLECULE TYPE: CDNA 
(iii) HYPOTHETICAL: YES 
(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: rat olfactory epithelium 

(B) STRAIN: Srpague-Dawley rat 

(F) TISSUE TYPE: olfactory epithelium 

(vii) IMMEDIATE SOURCE: 
(B) CLONE: 17 



(xi) SEQUENCE DESCRIPTION: SBQ ID NO: 8: 



ATGGAGCGAA 


GGAACCACAG 


TGGGAGAGTG 


AGTGAATTTG 


TGTTGCTGGG 


TTTCCCAGCT 


60 


CCTGCCCCAC 


TGCGAGTACT 


ACTATTTTTC 


CTTTCTCTTC 


TGGACTATGT 


GTTGGTGTTG 


120 


ACTGAAAACA 


TGCTCATCAT 


TATAGCAATT 


AGGAACCACC 


CAACCCTCCA 


CAAACCCATG 


180 


TATTTTTTCT 


TGGCTAATAT 


GTCATTTCTG 


CAGATTTGGT 


ATGTCACTGT 


TACGATTCCT 


240 


AAGATGCTCG 


CTGGCTTCAT 


TGGTTCCAAG 


GAGAACCATG 


GACAGCTGAT 


CTCCTTTGAG 


300 


GCATGCATGA 


CACAACTCTA 


CTTTTTCCTG 


GGCTTGGGTT 


GCACAGAGTG 


TGTCCTTCTT 


360 


GCTGTGATGG 


CCTATGACCG 


CTATGTGGCT 


ATCTGTCATC 


CACTCCACTA 


CCCCGTCATT 


420 


GTCAGTAGCC 


GCCTATGTGT 


GCAGATGGCA 


GCTGGATCCT 


GGGCTGGAGG 


TTTTGGTATC 


480 


TCCATGGTTA 


AAGTTTTCCT 


TATTTCTCGC 


CTGTCTTACT 


GTGGCCCCAA 


CACCATCAAC 


540 


CACTTTTTCT 


GTGATGTGTC 


TCCATTGCTC 


AACCTGTCAT 


GCACTGACAT 


GTCCACAGCA 


600 


GAGCTTACAG 


ACTTTCTCCT 


GGCCATTTTT 


AT TCTGCTGG 


GACCGCTCTC 


TGTCACTGGC 


660 


GCATCCTACA 


TGCCCATCAC 


AGGTGCTGTG 


ATGCGCATCC 


CCTCAGCTGC 


TGGCCGCCAT 


720 


AAAGCCTTTT 


CAACCTGTGC 


CTCCCACCTC 


ACTGTTCTGA 


TCATCTTCTA 


TGCAGCCAGT 


780 


ATTTTCATCT 


ATGCCAGGCC 


TAAGGCACTC 


TCAGCTTTTG 


ACACCAACAA 


GCTGGTCTCT 


840 


GTACTCTACG 


CTGTCATTGT 


ACCGTTGTTC 


AATCCCATCA 


TCTACTGCTT 


GCGCAACCAA 


900 
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GATCTCAAAA GAGCCCTACC TOGCACGCTG CACCTGGCCC AGCACCAGCA GCCCAATACC 



AACAAAGCCA GCAAAATTGG TTAG 

(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH t 939 base pairs 
(8) TYPEs nucleic acid 
(C) STRANDSDNZSSi single 
(0) TOPOLOGY i linear 

(ii> MOLECULE TYPE: cDNA 



984 



<U1> HYPOTHETICAL: YES 
<iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: rat olfactory epithelium 

(B) STRAIN: Srpague-Dawley rat 

(P) TISSUE TYPE: olfactory epithelium 

(vii) IMMEDIATE SOURCE: 
(B) CLONE: 18 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 

ATGAACAACA AAACTCTCAT CACCCATTTC CTCCTCCTGG GATTGCCCAT CCCCCCAGAG 60 

CACCAGCAAC TGTTCTTTGC CCTGTTCCTG ATCATGTACC TCACCACCTT TCTGOGAAAC 120 

CTGCTAATTG TTGTCCTTGT TCAACTGCAC TCTCATCTCC ACACACCCAT GTACTTGTTT 180 

CTCAGCAACT TGTCCTTCTC TCATCTCTCC TTTTCCTCTC TTACAATGCT CAAATTGCTC 240 

CAAAATATAC AGAGCCAAGT ACCATCTATA TCCTATGCAG GATGCCTGAC ACAGATATTC 300 

TTCTTTTTGT TGTTTGGCTA CCTTGGGAAT TTCCTTCTTG TAGCCATGGC CTATGACCGC 360 

TATGTGGCCA TCTGCTTCCC TCTGCATTAT ACCAACATCA TGAGCCATAA GCTCTGTACT 420 

TGTCTCCTGC TGGTATTTTG GATAATGACA TCATCTCATG CCATGATCCA CACCCTGCTT 480 

GCAGCAAGAT TGTCTTTTTG TGAGAACAAT GTACTCCTCA ACTTTTTCTG TGACCTGTTT 540 

GTTCTCCTAA AGTTGGCCTG CTCAGACACT TATGTTAATG AGTTGATGAT ACATATCATG 600 

GCCGTGATCA TCATTGTTAT TCCATTCGTG CTCATTGTTA TATCCTATGC CAAGATCATC 660 

TCCTCCATTC TTAAGGTTCC ATCTACTCAA AGCATTCACA AGGTCTTCTC CACTTGTGGT 720 

TCTCATCTCT CTGTGGTGTC TCTGTTCTAC GGGACAATTA TTGGTCTCTA TTTATGTCCA 780 

TCAGCTGATA ATTTTAGTCT AAAGGCGTCT GCCATGCCTA TGATGTACAC AGTGGTAACT 840 

CCAATGCTGA ACCCGTTCAT CTACACCCTA AG AAA C AG AG ACATCAAGCA CCCCCTAATA 900 

AGAGTTACCT GTAGCAAGAA AATCTCTCTG CCATGGTAC 939 
(2) INFORMATION FOR SEQ ID NO: 10: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH i 945 base pairs 
(BJ TYPE i nucleic acid 
(C) STRANDEDNESS: single 
(0) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: YES 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCEi 

(A) ORGAN ISM i rat olfactory epithelium 

(B) STRAIN: Srpague-Dawley rat 

(F) TISSUE TYPE: olfactory epithelium 

(vii) IMMEDIATE SOURCE : 
(B) CLONE: 19 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:10: 

ATGACTAGAA GAAACCAAAC TGCCATCTCT CAGTTCTTCC TTCTGGGCCT GCCATTCCCC 60 

CCAGAGTACC AACACCTGTT CTATGCCCTG TTCCTGGCCA TGTACCTCAC CACTCTCCTG 120 

GGGAACCTCA TCATCATCAT CCTCATTCTA CTGGACTCCC ATCTCCACAC ACCCATGTAC 180 

TTGTTTCTCA GCAATTTATC CTTTGCOGAC CTCTGTTTTT CCTCTGTCAC AATGCCCAAG 240 

TTCTTGCAGA ACATGCAGAG CCAAGTTCCA TCCATCCCCT ATGCAGGGTG CCTGCCACAG 300 

ATATACTTCT TTCTGTTTTT TGGAGACCTT GGAAACTTCC TGCTTGTGGC CATGGCCTAT 360 

GACCGCTATG TGGCCATCTG CTTCCCCCTT CATTACATGA GCATCATGAG CCCCAAGCTC 420 

TGTGTGAGTC TGGTGGTGCT GTCCTGGGTG CTGACTACCT TCCATGCCAT GCTGCACACC 480 

CTGCTCATGG CCAGATTGTC ATTCTGTGAG GACAGTGTGA TCCCTCACTA TTTCTCTGAT 540 

ATGTCTACTC TGCTGAAAGT GGCTTGTTCT GACACCCATG ATAATGAATT AGCAATATTT 600 

ATCTTAGGGG GCCCTATAGT TGTACTACCT TTCCTTCTCA TCATTGTTTC TTATGCAAGA 660 

ATTGTTTCCT CCATCTTCAA GGTCCCTTCT TCTCAAAGCA TCCATAAAGC CTTCTCCACC 720 

TGTCGCTCCC ACCTGTCTGT GGTGTCACTC TTCTATGGGA CAGTCATTGG TCTCTACTTA 780 

TGTCCTTCAG CTAATAACTC CACTGTGAAG GAGACTGTCA TCTCTTTGAT GTACACAATG 840 

GTGACACCCA TGCTGAACCC CTTCATCTAC AGCCTAAGAA AC AG AG AC AT AAAAGATGCA 900 

TTAGAAAAAA TAATGTCCAA AAAGCAAATT CCCTCCTTTC TATGA 945 
(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 645 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(it) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL » YES 
<iv) ANT I -SENSE i NO 

<vi) ORIGINAL SOURCE: 

(A) ORGANISM i homos aplen 

(vii) IMMEDIATE SOURCE: 

(B) CLONE: H5 

(ix) FEATURE i 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..645 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:ll: 
ATC TGT TTT GTG TCT ACC ACT GTC CCA AAC CAG CTG GTC AAC ATC CAG 48 

lie Cya Phe Val Ser Thr Thr Val Pro Lya Gin Leu Val Asn He Gin 
15 10 15 

ACA CAG AGC AC A GTC ATC ACC TAT GCA GAC TGC ATC ACC CAG ATG TGC 96 
Thr Gin Ser Arg Val He Thr Tyr Ala Asp Cya He Thr Gin Met Cya 

20 25 30 

TTT TTT ATA CTC TTT GTA CTG TTG GAC AGC TTA CTC CTG ACT GTG ATG 144 
Phe Phe He Leu Phe Val Val Leu Asp Ser Leu Leu Leu Thr Val Met 
35 40 45 

GCC TAT GAC CGG TTT GTG GCC ATC TCT CAC CCC CTG CAC TAC ACA CTC 192 
Ala Tyr Asp Arg Phe Val Ala He Cya Hie Pro Leu His Tyr Thr Val 
50 55 60 

ATT ATG AGC TCC TGC CTC TGT GGA CTG CTG GTT CTG GTG TCC TGC ATC 240 
lie Met Ser Ser Trp Leu Cya Gly Leu Leu Val Leu Val Ser Trp He 
65 70 75 80 

GTG AGC ATC CTA TAT TCT CTG TTA CAA AGC ATA ATG GCA TTG CAG CTG 268 
Val Ser He Leu Tyr Ser Leu Leu Gin Ser He Het Ala Leu Gin Leu 

85 90 95 

TCC TTC TGT ACA GAA CTG AAA ATC CCT CAA TTT TTC TGT GAA CTT AAT 336 
Ser Phe Cye Thr Glu Leu Lya He Pro Gin Phe Phe Cys Glu Leu Asn 

100 105 110 

CAG GTC ATC CAC CTT GCC TGT TCC CAC ACT TTT ATT AAT GAC ATG ATG 384 
Gin Val He Hie Leu Ala Cye Ser Asp Thr Phe He Aan Asp Met Met 
115 120 125 

ATG AAT TTT ACA AGT GTG CTC CTG GOT GGG GGA TGC CTC GCT GGA ATA 432 
Met Aen phe Thr Ser Val Leu Leu Gly Gly Gly Cye Leu Ala Gly He 
130 135 140 

TTT TAC TNN TAC TTT AAG ATA CTT TGT TGC ATA TGT TCG ATC TCA TCA 480 
Phe Tyr Xaa Tyr Phe Lya He Leu Cye Cya He Cye Ser He Ser Ser 
145 150 155 160 

GCT CAG GGG ATG AAT AAA GCA CTT TCC ACC TGT GCA TCT CAC CTC TCA 528 

Ala Gin Gly Met Asn Lye Ala Leu Ser Thr Cys Ala Ser Hie Leu Ser 

165 170 175 
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GTT GTC TCC TTA TTT TAT TGT ACA GGC GTA GGT GTG TAC CTT ACT TCT 576 
Val Val Ser Leu Phe Tyr Cys Thr Gly Val Cly Val Tyr Leu Ser Ser 

180 185 190 

GCT GCA ACC CAT AAC TCA CTC TCA AAT GCT GCA GCC TCG GTG ATG TAC 624 

Ala Ala Thr His Asn Ser Leu Ser Asn Ala Ala Ala Ser Val Met Tyr 

195 200 205 

ACT GTG GTC ACC TCC ATG CTC 645 
Thr Val Val Thr Ser Met Leu 
210 215 

(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH t 215 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:12: 

lie Cye Phe Val Ser Thr Thr Val Pro Lys Gin Leu Val Aon lie Gin 
15 10 15 

Thr Gin Ser Arg Val He Thr Tyr Ala Asp Cya He Thr Gin Met Cys 

20 25 30 

Phe Phe He Leu Phe Val Val Leu Asp Ser Leu Leu Leu Thr Val Met 
35 40 45 

Ala Tyr Asp Arg Phe Val Ala He Cye His Pro Leu His Tyr Thr Val 
50 55 60 

He Met Ser Ser Trp Leu Cyo Cly Leu Leu Val Leu Val Ser Trp He 
65 70 75 80 

Val Ser He Leu Tyr Ser Leu Leu Gin Ser He Met Ala Leu Gin Leu 

85 90 95 

Ser Phe Cys Thr Glu Leu Lya He Pro Gin Phe Phe Cys Glu Leu Am 

100 105 110 

Gin Val He Hie Leu Ala Cys Ser Asp Thr Phe He Asn Asp Met Met 
115 120 125 

Met Aen Phe Thr Ser Val Leu Leu Gly Gly Gly Cya Leu Ala Gly lie 
130 135 140 

Phe Tyr Xaa Tyr Phe Lye lie Leu Cye Cys He Cya Ser He Ser Ser 
145 150 155 160 

Ala Gin Gly Met ABn Lys Ala Leu Ser Thr Cys Ala Ser His Leu Ser 

165 170 175 

Val Val Ser Leu Phe Tyr Cye Thr Gly Val Gly Val Tyr Leu Ser Ser 

180 185 190 

Ala Ala Thr His Asn Ser Leu Ser Asn Ala Ala Ala Ser Val Met Tyr 
195 200 205 
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Thr Val Val Thr Ser Met Leu 
210 215 

(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS; 

(A) LENGTH i 640 base pairs 
( B} TYPE: nucleic acid 

(C) STRAND ED NESS i single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: CDNA 
<iii) HYPOTHETICAL t YES 
(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: rat olfactory epithelium 

(B) STRAIN: Srpague-Dawley rat 

(F) TISSUE TYPE: olfactory epithelium 

(vii) IMMEDIATE SOURCE : 
(8) CLONE: Jl 

(ix) FEATURE: 

(A) NAME /KEY: CDS 

(B) LOCATION: 2.. 640 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13; 

C ATC TGC TTT ACT TCT GCT AGC ATC CCA AAG ATG CTA CTG AAT ATA 46 
lie Cye Phe Thr Ser Ala Ser lie Pro Lye Met Leu Val Aen He 
15 10 15 

CAC ACG AAG AAC AAG GTG ATC ACC TAT GAA CGC TCC ATC TCC CAA GTA 94 
Gin Thr Lye Asn Lys Val He Thr Tyr Glu Gly Cye He Ser Gin Val 

20 25 30 

TAC TTT TCA TAC TCT TTG GAG TTT TGG ACA ACT TTC TTC TOG ACT GTG 142 
Tyr Phe Ser Tyr Ser Leu Glu Phe Trp Thr Thr Phe Phe Ser Thr Val 

35 40 45 

ATG GCC TAT GAC CGA TAT GTG GCC ATC TGT CAC CCA TCT NAC TAC ACA 190 
Met Ala Tyr Asp Arg Tyr Val Ala He Cye His Pro Ser Xaa Tyr Thr 
50 55 60 

GGT CAT CAT GAA CCN NNN NNN NNN NNN NNN NNN NNN NNN NNN NNN NNN 238 
Gly His His Glu Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 
65 70 75 

NNN NNN NNN NNN NNN NNN NNN NNN NNN NNN NNN NNN NNN NNN NNN NNN 286 
Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 
80 65 90 95 

NNN NNN NNN NNN NNN NNN NNN NNN NNN NNN NNN NNN NNN NNN NNN NNN 334 
Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 

100 105 110 

NNN NNN NNN NNN NNN NNN NNN NNN NNN NNN NNN NNN NNN NNN NNN NNN 382 
Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 
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115 120 125 

NNN NNS NNN NNN NNN NNN NNN NNN NNN KNN NNN NNN NNN NNN NNN NTT 430 
Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 
130 135 140 

TAT TCT TAC TCT AAG ATA GTT TCC TCC ATA CCA GAA ATC TCA TCA TCA 478 
Tyr Ser Tyr Ser Ly» lie Val Ser Ser 11* Arg Clu He Ser Ser Ser 
145 150 155 

CAG GGA AAG TAC AAG NNA TTC TCC ACC TGT GCA TCC CAC CTC TCA GTT 526 
Gin Gly Lys Tyr Lye Xaa Phe Ser Thr Cys A1& Ser His Leu Ser Val 
160 165 170 175 

GTT TCA TTA TTC TAT TCT ACA CTT TTC GGT GTG TAC CTT AGT TCT TCT 574 

Val ser Leu Phe Tyr Ser Thr Leu Leu Gly Val Tyr Leu Ser Ser Ser 

180 165 190 

TTT ACC CAA AAC TCA CAC TCA ACT GCA CGG GCA TCT GTT ATG TAC AGT 622 
Phe Thr Gin Aen Ser His ser Thr Ala Arg Ala Ser Val Met Tyr Ser 

195 200 205 

GTG GTC ACC CCC ATG TTG 640 
Val Val Thr Pro Met Leu 
210 



(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 213 amino acido 
(S) TYPE: amino acid 
(D) TOPOLOGY: linear 

(Li) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 

He Cys Phe Thr Ser Ala Ser He Pro Lye Met Leu Val Aen He Gin 
15 10 15 

Thr Lys Asn Lye Val He Thr Tyr Glu Gly Cys He Ser Gin Val Tyr 

20 25 30 

Phe Ser Tyr Ser Leu Glu Phe Trp Thr Thr Phe Phe Ser Thr Val Met 
35 40 45 

Ala Tyr Asp Arg Tyr Val Ala He Cys His Pro Ser Xaa Tyr Thr Gly 
50 55 60 

His His Glu Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 

65 70 75 80 

Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa xaa Xaa Xaa Xaa 

85 90 95 

Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 

100 105 110 

Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 

115 120 125 
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Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Tyr 

130 135 140 

Ser Tyr Ser Lya lie Val Ser Ser lie Arg Glu He Ser Ser Ser Gin 
145 150 155 160 

Gly Lye Tyr Lya Xaa Phe Ser Thr Cye Ala Ser Hie Leu Ser Val Val 

165 170 175 

Ser Leu Phe Tyr Ser Thr Leu Leu Gly Val Tyr Leu Ser Ser Ser Phe 

180 185 190 

Thr Gin Ain Ser Hie Ser Thr Ala Arg Ala Ser Val Met Tyr Ser Val 

195 200 205 

Val Thr Pro Met Leu 
210 

(2) INFORMATION FOR SEQ ID NO: 15: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 636 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

<A) ORGANISM: rat olfactory epithelium 

(B) STRAIN: srpague-Dawley rat 

(F) TISSUE TYPE: olfactory epithelium 

(vii) IMMEDIATE SOURCE: 
(8) CLONE: J2 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..636 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 

ACC TCC ACC ACC ATC CCA AAG ATG CTG GTA AAT ATA CAC ACC CAG AGC 48 
Thr Ser Thr Thr He Pro Lye Met Leu Val Aan He Hie Thr Gin Ser 
1 5 10 15 

AAT ACT ATC ACC TAT GAA GAC TGT ATT TCC CAG ATG TTT GTA CTC TTG 96 
Aen Thr He Thr Tyr Glu Asp Cys He Ser Gin Met Phe Val Leu Leu 

20 25 30 

GTT TTT GGA GAA CTG CAC AAC TTT CTC CTG GCT GTG ATG GCC TAT GAT 144 
Val Phe Gly Glu Leu Asp Aan Phe Leu Leu Ala Val Met Ala Tyr Asp 
35 40 45 

CGA TAT GTG GCT ATC TGT CAC CCA CTG TAT TAG ACA GTC ATT GTG AAC 192 
Arg Tyr Val Ala He Cys His Pro Leu Tyr Tyr Thr Val He Val Aen 
50 55 60 
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CAC CGA CTC TGT ATC CTG CTG CTT CTG CTG TCC TOG GTT GTC AGC ATT 240 

His Arg Leu Cya Ila Leu Leu Leu Leu Leu Ser Trp Val Val Ser lie 

65 70 75 60 

TTA CAT GCC TTC TTA CAG AGC TTA ATT GTA CTA CAG TTG ACC TTC TGT 288 

Leu Hia Ala Phe Leu Gin Ser Leu lie Val Leu Gin Leu Thr Phe Cy» 

85 90 95 

GGA GAT GTG AAA ATC CCT CAC TTC TTC TGT GAG CTC AAT CAG CTG TCC 336 

Gly Asp Val Ly§ lie Pro Hia Phe Phe Cya Glu Leu Aan Gin Leu Ser 

100 105 110 

CAA CTC AC A TGT TCA GAC AAC TTT CCA AGT CAC CTC ACA ATG CAT CTT 384 

Gin Leu Thr Cyi Ser Asp Asn Phe Pro Ser Hie Leu Thr Met Hia Leu 

US 120 125 

GTA CCT GTT ATA TTT GCA GCT ATT TCC CTC AGT GGT ATC CTT TAC TCT 432 

Val Pro Val lie Phe Ala Ala lie Ser Leu Ser Gly lie Leu Tyr Ser 

130 135 140 

TAT TTC AAG ATA GTG TCT TCC ATA CGT TCT ATG TCC TCA GTT CAA GGG 480 

Tvr Phe Lya Ila Val Ser Ser lie Arg Ser Met Ser Ser Val Gin Gly 

145 150 155 160 

AAG TAC AAG GCA TTT TCT ACA TGT GCC TCT CAC CTT TCC ATT GTC TCC 528 

Lya Tyr Lye Ala Phe Ser Thr Cya Ala Ser Hie Leu Ser He Val Ser 

165 170 175 

TTA TTT TAT AGT ACA GGC CTC GGG GTG TAC GTC AGT TCT GCT GTG ATC 576 

Lau Phe Tyr ser Thr Gly Leu Gly Val Tyr Val ser Ser Ala Val lie 

180 165 190 

CGA ACC TCA CAC TCC TCT CCA AGT GCT TCG GTC ATG TAT ACT GTG GTC 624 

Arg Ser Ser His Ser Ser Ala Ser Ala Ser Val Met Tyr Thr Val Val 

195 200 205 

ACC CCC ATC TTG 636 
Thr Pro Met Leu 
210 

(2) INFORMATION FOR SEQ ID NO: 16 : 

\i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 212 amino acida 

(B) TYPE: amino acid 
(0) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 

Thr Ser Thr Thr He Pro Lya Met Leu Val Aan He Hia Thr Gin Ser 

15 10 15 

Aan Thr He Thr Tyr Glu Asp Cya He Ser Gin Met Phe val Leu Leu 

20 25 30 

Val Phe Gly Glu Leu Aep Aan Phe Leu Leu Ala Val Met Ala Tyr Asp 

35 40 45 

Arg Tyr Val Ala He Cys Hie Pro Leu Tyr Tyr Thr Val He Val Aan 
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50 55 60 

Hia Arg Leu Cya lie Leu Leu Leu Leu Leu Ser Trp Val Val Ser lie 

65 70 75 80 

Leu Hifl Ala Phe Leu Cln Ser Leu lie Val Leu Gin Leu Thr Phe Cye 

85 90 95 

Gly Asp Val Lya lie Pro Hia Phe Phe Cy» Glu Leu Aen Gin Leu Ser 

100 105 no 

Gin Leu Thr Cya Ser Aep Ann Phe Pro Ser Hia Leu Thr Met Hia Leu 

115 120 125 

Val Pro Val lie Pn« Ala Ala lie Ser Leu Ser Gly He Leu Tyr Ser 

130 135 140 

Tyr Phe Lye lie Val Ser Ser lie Arg Ser Met Ser Ser Val Gin Gly 
145 150 155 160 

Lye Tyr Lye Ala Phe Ser Thr Cya Ala Ser Hia Leu Ser He Val Ser 

165 170 175 

Leu Phe Tyr Ser Thr Gly Leu Gly Val Tyr Val Ser Ser Ala Val He 

180 185 190 

Arg Ser Ser Hie Ser Ser Ala Ser Ala Ser Val Met Tyr Thr Val Val 
195 200 205 

Thr Pro Met Leu 
210 

(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 646 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND ED NESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: YES 
(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: rat olfactory epithelium 

(B) STRAIN: erpague-Dawley rat 

(F) TISSUE TYPE: olfactory epithelium 

(vii) IMMEDIATE SOURCE: 
(B) CLONE: J4 

<ix) FEATURE: 

(A) NAME/KEY: CDS 
<B) LOCATION: 2 -.646 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 
C ATA GGC TAT TCA TCT TCT GTC ACA CCC AAT ATC CTT GTC AAC TTC 46 
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Ile Gly Tyr Ser Sor Ser Val Thr Pro Aen Met Leu Val Aan Phe 

15 10 15 

CTT ATA AAC CAA AAT ACC ATC TCA TAC CTT GGA TGT TCT ATA CAG TTT 
Leu lie Lya Gin Aan Thr XI* Sor Tyr Leu Gly Cye Ser lie Gin Phe 

20 25 30 



GCC TCA GCT GCT TTG TTT GGA GGT CTT CAA TGC TTC CTT CTC GCT GCC 
Gly Sar Ala Ala Lou Phe Gly Gly Leu Glu Cye Phe Leu Leu Ala Ala 

35 40 45 



142 



ATG GCG TAT GAT CGT TTT GTA GCA ATC TGC AAC CCA CTG CTT TAT TCA 
Met Ala Tyr Asp Arg Phe Val Ala lie Cye Aen Pro Leu Leu Tyr Ser 
50 55 60 



190 



ACG AAA ATC TCC ACA CAA CTC TGT GTC CAG TTC CTT CTG GGA TCT TAT 

Thr Lya Met Ser Thr Gin Val Cye Val Gin Leu Val Val Gly Ser Tyr 

65 70 75 



236 



ATA GGG GGA TTT CTT AAT GCC TCC TCT TTT ACC CTT TCC TTT TTT TCC 
tie Gly Gly Phe Leu Aen Ala Ser Ser Phe Thr Leu Ser Phe Phe Ser 
80 85 90 95 



286 



TTG TCC TTC TGT GGA CCA AAT AGA ATC AAT CAC TTT TAC TGT GAT TTT 
Leu Ser Phe Cye Gly Pro Aan Arg He Aen Hie Phe Tyr Cye Asp Phe 

100 105 110 



334 



GCT CCG TTA GTA GAA CTT TCT TGC TCT GAT GTC AGT GTT CCT GAT GCT 

Ala Pro Leu Val Glu Leu Ser Cye Ser Asp Val Ser Val Pro Aep Ala 

115 120 125 



382 



GTT ACC TCA TTT TCT GCT GCC TCA GTT ACT ATG CTC ACA GTG TTT ATC 
Val Thr Ser Phe Ser Ala Ala Ser Val Thr Met Leu Thr Val Phe He 
130 135 140 



430 



ATA CCC ATC TCC TAT ACC TAT ATC CTC ATC ACC ATC CTG AAG ATG CGT 
He Ala He Ser Tyr Thr Tyr Xle Leu He Thr He Leu Lye Met Arg 

145 ISO 155 



478 



TCC ACT GAG GGT CGA CAG AAA GCA TTC TCT ACC TGC ACT TCC CAC CTC 
Ser Thr Glu Gly Arg Gin Lye Ala Phe Ser Thr Cye Thr Ser Hie Leu 

160 165 170 175 



526 



ACT GCA GTC ACT CTG TGC TAT GGA ACC ATC ACA TTC ATC TAT GTG ATG 

Thr Ala Val Thr Leu Cye Tyr Gly Thr He Thr Phe He Tyr Val Met 

180 185 190 



574 



CCC AAG TCC AGC TAC TCC ACA GAC CAG AAC AAG GTG GTG TCT GTG TTT 
Pro Lye Ser Ser Tyr Ser Thr Asp Gin Aen Lye Val Val Ser Val Phe 

195 200 205 



622 



TAT ATG GTG GTG ATC CCC ATG TTG 
Tyr Met val Val He Pro Met Leu 

210 215 



646 



(2) INFORMATION FOR SEQ ID NO:18t 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 215 amino acide 
(8) TYPE: amino acid 
(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 

He Cly Tyr Ser Ser Ser Val Thr Pro Aan Met Leu Val Aan Phe Leu 
15 10 15 

He Lya Gin Aan Thr He Ser Tyr Leu Cly Cya Ser He Gin Phe Gly 

20 25 30 

Ser Ala Ala Leu Phe Gly Gly Leu Glu Cye Phe Leu Leu Ala Ala Met 
35 40 45 

Ala Tyr Aap Arg Phe Val Ala He Cye Aan Pro Leu Leu Tyr Ser Thr 
50 55 60 

Lye Met Ser Thr Gin Val Cya Val Gin Leu Val Val Gly Ser Tyr He 
65 70 75 80 

Gly Gly Phe Leu Aen Ala Ser Ser Phe Thr Leu Ser Phe Phe Ser Leu 

85 90 95' 

Ser Phe Cya Gly Pro Aan Arg He Aan Hia Phe Tyr Cya Aep Phe Ala 

100 105 110 

Pro Leu Val Glu Leu Ser Cya Ser Asp Val Ser Val Pro Aap Ala Val 
115 120 125 

Thr Ser Phe Ser Ala Ala Ser Val Thr Met Leu Thr Val Phe He He 
130 135 140 

Ala He Ser Tyr Thr Tyr He Leu He Thr He Leu Lya Met Arg Ser 
145 150 155 160 

Thr Glu Gly Arg Gin Lya Ala Phe Ser Thr Cya Thr Ser Hia Leu Thr 

165 170 175 

Ala Val Thr Leu Cye Tyr Gly Thr He Thr Phe He Tyr Val Met Pro 

180 185 190 

Lya Ser Ser Tyr Ser Thr Aap Gin Aan Lya Val Val Ser Val Phe Tyr 

195 200 205 

Met Val Val He Pro Met Leu 

210 215 

(2) INFORMATION FOR SEQ ID NO: 19: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 481 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: YES 

(iv) ANTI-SENSE: NO 

(vi> ORIGINAL SOURCE: 

(A) ORGANISM: rat olfactory epithelium 
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(B ) STRAIN: Srpague-Dawley rat 

(F) TISSUE TYPE: o 1 factory epithelium 

(viij IMMEDIATE SOURCE: 
(B) CLONE* J7 

(ix) FEATURE: 

(A) NAME /KEY : CDS 

(B) LOCATION: 2., 481 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 

C ATC TGC AAG CCC CTC CAC TAC ACC ACC ATC ATG AAT AAC CCA GTG 46 
lie Cyb Lya Pro Leu Hta Tyr Thr Thr lie Met Aen Aan Arg Val 
15 10 15 

TGC ACA GTT CTA GTC CTC TCC TGT TGG TTT GCT GGC CTG TTG ATC ATC 94 
Cye Thr Val Leu Val Leu Ser Cya Trp Phe Ala Gly Leu Leu lie lie 

20 25 30 

CTC CCA CCT CTT GGT CAT GGC CTC CAG CTG GAG TTC TGT GAC TCC AAT 142 
Leu Pro Pro Leu Gly His Gly Leu Gin Leu Glu Phe Cya Asp Ser Asn 

35 40 45 

GTG ATT GAT CAT TTT GGC TGT GAT GCC TCT CCA ATT CTG CAG ATA ACC 190 
Val He ABp Hia Phe Gly Cye Asp Ala Ser Pro He Lou Gin He Thr 
50 55 60 

TGC TCA GAC ACG GTA TTT ATA GAG AAA ATT GTC TTG GCT TTT GCC ATA 238 
Cya Ser Asp Thr Val Phe He Glu Lya He Val Leu Ala Phe Ala He 
65 70 75 

CTG ACA CTC ATC ATT ACT CTG. GTA TGT GTT GTT CTC TCC TAC ACA TAC 286 
Leu Thr Leu He He Thr Leu Val Cya Val Val Leu Ser Tyr Thr Tyr 
80 65 90 95 

ATC ATC AAG ACC ATT TTA AAG TTT CCT TCT GCT CAA CAA AGA AAA AAG 334 
lie He Lye Thr He Leu Lye Phe Pro Ser Ala Gin Gin Arg Lya Lya 

100 105 HO 

GCC TTT TCT ACA TGT TCT TCC CAC ATG ATT GTG GTT TCC ATC ACC TAT 382 
Ala Phe Ser Thr Cya Ser Ser Hia Met He Val Val Ser He Thr Tyr 

115 120 125 

GOG AGC TGT ATT TTC ATC TAC ATC AAA CCT TCA GCG AAG GAA GGG GTA 430 
Glv Ser Cye He Phe He Tyr He Lya Pro Ser Ala Lya Glu Gly Val 
130 135 140 

GCC ATC AAT AAG GTT GTA TCT GTG CTC ACA ACA TCA GTC GCC CCT TTG 478 
Ala He Asn Lya Val Val Ser Val Leu Thr Thr Ser Val Ala Pro Leu 
145 150 155 



CTC 
Leu 
160 



(2) INFORMATION FOR SEQ ID NO: 20: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 160 amino acids 



481 
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(0) TYPE: amino acid 
(D) TOPOLOGY i linear 

(ii) MOLECULE TYPE: protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20: 

He Cya Lye Pro Leu Hit Tyr Thr Thr He Met Aan Aan Arg Val Cya 

15 10 15 

Thr Val Leu Val Leu Ser Cys Trp Phe Ala Gly Leu Leu He He Leu 

20 25 30 

Pro Pro Leu Gly Hia Gly Leu Gin Leu Glu Phe Cys Aap Ser Aan Val 
35 40 45 

He Aap Hia Phe Gly Cya Aap Ala Ser Pro He Leu Gin He Thr Cya 

50 55 60 

Ser Aap Thr Val Phe He Glu Lye lie Val Leu Ala Phe Ala He Leu 
65 70 75 80 

Thr Leu He He Thr Leu Val Cys Val Val Leu Ser Tyr Thr Tyr He 

85 90 95 

He Lye Thr He Leu Lye Phe Pro Ser Ala Gin Gin Arg Lye Lye Ala 

100 105 110 

Phe Ser Thr Cya Ser Ser Hie Met He Val Val Ser He Thr Tyr Gly 
115 120 125 

Ser Cya He Phe He Tyr He Lya Pro Ser Ala Lye Glu Gly Val Ala 
130 135 140 

He Aan Lya Val Val Ser Val Leu Thr Thr Ser Val Ala Pro Leu Leu 

145 150 155 160 



(2) INFORMATION FOR SEQ ID NO: 21: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 481 base pair a 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : aingle 
(0) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL : YES 

(iv) ANTI-SENSE; NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: rat olfactory epithelium 

(B) STRAIN: Srpague-Dawley rat 

(F) TISSUE TYPE: olfactory epithelium 

(vii) IMMEDIATE SOURCE: 
(B) CLONE: J8 

(ix) FEATURE: 

(A) NAME/KEY: CDS 
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(B) LOCATION: 2.. 481 

(xi) SEQUENCE DESCRIPTION: SEQ ID NOt21i 

C ATC TGC CAC CCC CTC CAC TAC TCT CTT CTC ATG AGT CCT CAC AAC 46 
He Cyi His pro Leu Hii Tyr Ser Leu Leu Hat Ser Pro Asp Asn 

15 10 15 

TCT CCT CCT CTC CTA ACA CTC TCC TCG GTG ACA CGC CTG CCC ACG CCC 94 
Cys Ala Ala Leu Val Thr Val Ser Trp Val Thr Cly Val Gly Thr Gly 

20 25 30 

TTC CTC CCT TCC CTC CTC ATT TCT AAG TTG CAC TTC TGT GGC CCC AAC 142 
Phs Leu Pro Ser Leu Leu lie Ser Lys Leu Asp Phe Cys Gly Pro Asn 

35 40 45 

CGC ATC AAC CAT TTC TTC TGT CAC CTC CCT CCA TTA ATC CAG CTG TCC 190 
Arg lie Aan His Phe Phe Cys Asp Leu Pro Pro Leu lie Gin Leu Ser 
50 55 60 

TGC TCC AGC GTC TTT CTG ACA GAA ATG GCC ATC TTT GTC CTG TCC ATC 238 
Cys Ser Ser Val Phe Val Thr Glu Met Ala lie phe Val Leu Ser He 
65 70 75 

GCT GTC CTC TGC ATC TGT TTC CTC CTA ACC CNN NNN TCC TAC ATT TTC 286 
Ala Val Leu Cys He Cys Phe Leu Leu Thr Xaa Xaa Ser Tyr He Phe 
80 85 90 95 

ATA CTG TCC TCC ATT CTG AG A ATC CCT TCC ACT ACC GGC AGG ATG AAG 334 
He Val Ser Ser He Leu Arg He Pro Ser Thr Thr Gly Arg Met Lys 

100 105 110 

ACA TTT TCT ACA TGT GGC TCC CAC CTG GCC GTC GTC ACC ATC TAC TAT 382 
Thr Phe Ser Thr Cys Gly Ser His Leu Als Val Val Thr lie Tyr Tyr 

115 120 125 

GGG ACC ATG ATC TCC ATG TAT GTC GGC CCA AAT GCC CAT CTG TCC CCC 430 

Gly Thr Het He Ser Met Tyr Val Gly Pro Asn Ala His Leu Ser Pro 

130 135 140 

GAG CTC AAC AAG GTC ATT TCT CTC TTC TAC ACT CTG ATC ACC CCA CTA 478 
Glu Leu Asn Lys Val He Ssr Val Phe Tyr Thr Val He Thr Pro Leu 

145 150 155 



CTG 
Leu 
160 



(2) INFORMATION FOR SEQ ID NO: 22: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 160 amino acids 

(B) TYPE: amino acid 
(DJ TOPOLOG*: linear 

(ii) MOLECULE TYPE; protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22: 

He CyB His Pro Leu His Tyr Ser Leu Leu Met Ser Pro Asp Aen Cys 
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15 10 15 

Ala Ala Leu Val Thr Val Ser Trp Val Thr Cly Val Gly Thr Cly Pha 

20 25 30 

Leu Pro Set Leu Leu lie Ser Lye Leu Asp Phe Cys Gly Pro Asn Arg 
35 40 45 

lie Aan His Phe Phe Cy» Asp Leu Pro Pro Leu lie Gin Leu Ser Cys 
50 55 60 

Ser Ser Val Phe Val Thr Clu Met Ala lie Phe Val Leu Ser He Ala 
65 70 75 80 

Val Leu Cys He Cye Phe Leu Leu Thr Xaa Xaa Ser Tyr He Phe He 

85 90 < 95 

Val Ser Ser He Leu Arg He Pro Ser Thr Thr Gly Arg Met Lye Thr 

100 105 110 

Phe Ser Thr Cys Gly Ser Hie Leu Ala Val Val Thr He Tyr Tyr Gly 
115 120 125 

Thr Met lie Ser Met Tyr Val Gly Pro Asn Ala Hia Leu Ser Pro Glu 
130 135 140 

Leu Asn Lye Val He Ser Val Phe Tyr Thr Val He Thr Pro Leu Leu 
145 150 155 ISO 

(2) INFORMATION FOR SEQ ID NO: 23; 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 646 base pairs 

(B) TYPE: nucleic acid 

(C) STRAWDEDNESS: single 
<D} TOPOLOGY : linear 

<ii} MOLECULE TYPE: protein 

(iii) HYPOTHETICAL t YES 

(iv) ANTI-SENSE: NO 

<vi) ORIGINAL SOURCE: 

(A) ORGANISM: rat olfactory epithelium 

(B) STRAIN: Srpague-Dawley rat 

(?) TISSUE TYPE: olfactory epithelium 

(vii) IMMEDIATE SOURCE: 
(B) CLONE: Jll 

(ix) FEATURE: 

(A) NAME/KEY: CDS 
(8) LOCATION: 2. .646 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23: 

N GTC TGC TTC TCC TCC ACC ACT GTC CCC AAG GTA CTG GCT AAC CAC 46 

Val Cys Phe Ser Ser Thr Thr Val Pro Lya Val Leu Ala Asn His 
15 10 15 
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ATA CTC ACT AOT CAO GCC ATT TCC TTC TCT COO TCT CTA ACT CAG CTC 
lis Leu Ser Ser GLn Ala II* Ser Pho Ser Gly Cys Lou Thr Gin Lou 

20 25 30 



94 



TAT TTT CTC TGT CTG TCT CTG AAT ATG CAC AAT TTC CTC CTG GOT CTC 142 
Tyr Phe Lou Cys Val Sor Val Aan Hot Asp Aan Pho Lou Lou Ala Val 

35 40 45 

ATG GCC TAT GAC AG A TTT CTG GCC ATA TGC CAC CCT TTG TAC TAC ACA 190 
Met Ala Tyr Aep Arg pho Val Ala Ilo Cy« Hi* Pro Lo u xyr Tyr Thr 
50 55 60 

ACA AAG ATG ACC CAC CAG CTC TGT CTC TTG CTG CTG TCT GGA TCA NNN 238 
Thr Lys Met Thr His Gin Lou Cys Val Lou Lou val Sor Gly ser Xaa 

65 70 75 

NNN NNN NNN NNN NNN NNN NNN NNN NNN NNN NNN NNN NNN NNN NNN NNN 286 
Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa xaa 

80 85 90 95 

NNN NNN NNN NNN NNN NNN NNN NNN NNN NNN NNN NNN NNN NNN NNN NNN 334 
Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 

100 105 HO 

NNN NNN NNN NNN NNN NNN NNN NNN NNN NNN NNN NNN NNN NNN NNN NNN 382 
Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 

115 120 125 

NNN NNN NNN NNN NNN NNN NNT GTG ATC ATG CTC ACC CCA TTT GTC TGC 430 

Xaa Xaa Xaa Xaa Xaa Xaa Xaa Val Ilo Hot val Thr Pro Phe val Cys 

130 135 140 

ATC CTC ATC TCT TAC ATC TAC ATC ACC AAT GCA GTC CTC AG A CTC TCA 478 
He Leu Ilo Sor Tyr He Tyr Ilo Thr Aan Ala Val Leu Arg Val Sor 
145 150 155 

TCC TTT ACC CCA CCA TGC AAA GCC TTC TCC ACC TGT GGC TCA CAC CTG 526 
ser Phe Arg Gly Gly Trp Lya Ala Pho Sor Thr Cys Gly Ser His Lou 
160 165 170 175 

GCT GTG GTC TGC CTC TTC TAT GGC ACC ATC ATT GCT CTG TAT TTC AAT 574 
Ala Val Val Cys Lou Pho Tyr Gly Thr Ilo Ilo Ala Val Tyr Pho Asn 

180 185 190 

CCT GTA TCT TCC CAT TCA TCT GAG AAG GAC ACT GCA CCA ACT CTC CTA 622 

Pro Val Ser Ser His Ser ser Glu Lys Asp Thr Ala Ala Thr Val Leu 

195 200 205 

TAC ACA GTG GTG ACT CCC ATG TTG 646 
Tyr Thr Val Val Thr Pro Met Leu 
210 215 

<2) X N FORMAT I ON FOR SEQ ID NO: 24: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 215 amino acids 
(8) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
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(xi) SEQUENCE DESCRIPTION: SEQ ID KOt24t 

Val Cys Phe Ser Ser Thr Thr Val Pro Lya Val Leu Ala Aan His lie 

15 10 15 

Leu Ser Ser Gin Ale lie Ser Phe Ser Cly Cya Leu Thr Gin Leu Tyr 

20 25 30 

Phe Leu Cye Val Ser Val Aan Met A*p Aan Phe Leu Leu Ale Vel Met 

35 40 45 

Ala Tyr Asp Arg Phe Val Ala He Cye Hia Pro Leu Tyr Tyr Thr Thr 

50 55 60 

Lys Met Thr Hia Gin Leu Cya Val Leu Leu Val Ser Gly Ser Xaa Xaa 

65 70 75 80 

Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 

85 90 95 

Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 

100 105 110 

Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 

115 120 125 

Xaa Xaa Xaa Xaa Xaa Xaa Val He Met Val Thr Pro Phe Val Cya He 

130 135 140 

Leu lie Ser Tyr He Tyr He Thr Aan Ala Val Leu Arg Val Ser Ser 
145 150 155 160 

Phe Arg Gly Gly Trp Lya Ala Phe Ser Thr Cya Gly Ser Hia Leu Ala 

165 170 175 

Val Val Cya Leu Phe Tyr Gly Thr He He Ala Val Tyr Phe Aan Pro 

180 185 190 

Val Ser Ser Hia Ser ser Glu Lya Aap Thr Ala Ala Thr Val Leu Tyr 

195 200 205 

Thr Val Val Thr Pro Met Leu 

210 215 

(2) INFORMATION FOR SKQ ID NO: 25: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 646 base pair a 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: YES 
(iv) ANTI-SENSE : NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: rat olfactory epithelium 

(B) STRAIN: Srpague-Dawley rat 

(F) TISSUE TYPE; olfactory epithelium- 
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(vii) IMMEDIATE SOURCE: 
(B) CLONE : J14 

(ixj FEATURE: 

(A) NAME/KEY: CDS 
<B) LOCATIONi 2.. 646 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO J 25 1 

T GTC TGC TTC TCC TCC ACC ACT GTC CCC AAG GTA CTG GCT AAC CAC 46 
Val Cys Phe Ser Ser Thr Thr Val Pro Lys Val Leu Ala Asn HI* 

15 10 is 

ATA CTC ACT ACT CAC GCC ATT TCC TTC TCT GGG TGT CTA ACT CAG CTG 94 
Il» Leu Ser Ser Gin Ala lie Ser Phe Ser Gly Cys Leu Thr Gin Leu 

20 25 30 

TAT TTT CTC TGT GTG TCT GTG AAT ATG GAC AAT TTC CTG CTG GCT GTG 142 
Tyr Phe Leu Cys Val Ser Val Aba Met Asp Asn Phe Leu Leu Ala Val 

35 40 45 

ATG GCC TAT GAC AGA TTT GTG GCC ATA TGC CAC CCT TTG TAC TAC ACA 190 
Met Ala Tyr Asp Arg Phe Val Ala He Cys His Pro Leu Tyr Tyr Thr 
50 55 60 

ACA CCG ATG ACC CAC CAG CTC TGT GTC TTG CTG CTC TCT GGA TCA NNN 238 
Thr Pro Met Thr His Gin Leu Cys Val Leu Leu Val Ser Gly Ser Xaa 

65 70 75 

NNN NNN NNN NNN NNN NNN NNN NNN NNN NNN NNN NNN NNN NNN NNN NNN 286 
Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 

80 85 90 95 

NNN NNN NNN NNN NNN NNN NNN NNN NNN NNN NNN NNN NNN NNN NNN NNN 334 
Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 

100 105 110 

NNN NNN NNN NNN NNN NNN NNN NNN NNN NNN NNN NNN NNN NNN NNN NNN 382 
Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 

115 120 125 

NNN NNN NNN NNN NNN NNN NNT GTG ATC ATG GTC ACC CCA TTT GTC TGC 430 
Xaa Xaa Xaa Xaa Xaa Xaa Xaa Val He Met Val Thr Pro Phe Val Cys 
130 135 140 

ATC CTC ATC TCT TAC ATC TAC ATC ACC AAT GCA GTC CTC AGA GTC TCA 478 
He Leu He Ser Tyr He Tyr He Thr . Asn Ala Val Leu Arg Val Ser 
145 150 155 

TCC TTT AGG GGA GGA TGG AAA GCC TTC TCC ACC TGT GGC TCA CAC CTG 526 
Ser Phe Arg Gly Gly Trp Lye Ala Phe Ser Thr Cys Gly Ser His Leu 
160 165 170 175 

GCT GTG GTC TGC CTC TTC TAT GGC ACC ATC ATT GCT GTG TAT TTC AAT 574 
Ala Val Val Cys Leu Phe Tyr Gly Thr He He Ala Val Tyr Phe Asn 

180 1B5 190 

CCT GTA TCT TCC CAT TCA TCT GAG AAG GAC ACT GCA GCA ACT GTG CTA 622 
Pro Val Ser Ser His Ser Ser Glu Lys Asp Thr Ala Ala Thr Val Leu 

195 200 205 
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TAC ACA GTG GTG ACT CCC ATG TTG $45 
Tyr Thr Val Val Thr Pro Ret Leu 
210 215 

(2) INFORMATION FOR SIQ 10 NOi26i 

(I) SEQUENCE CHARACTERISTICS* 

(A) LENGTH 1 215 amino acids 

(B) TYPE: amino acid 
(0) TOPOLOGY 1 linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION t SEQ 10 NOt26: 

Val Cys Phe Ser Ser Thr Thr Val Pro Lys Val Leu Ala Aen His lie 

15 10 15 

Lou Ser Ser Gin Ala lie Ser Phe Ser Gly Cys Leu Thr Gin Leu Tyr 

20 25 30 

Phe Leu Cy» Val Ser Val Asn Met Asp Asn Phe Leu Leu Ala Val Met 
35 40 45 

Ala Tyr Asp Arg Phe Val Ala lie Cy» Hie Pro Leu Tyr Tyr Thr Thr 

50 55 60 

Pro Met Thr Hie Gin Leu Cys Val Leu Leu Val Ser Gly Ser Xaa Xaa 

65 70 75 8Q 

Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 

85 90 95 

Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 

100 105 110 

Xaa xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 

115 120 125 

Xaa Xaa Xaa Xaa Xaa Xaa Val He Met Val Thr Pro Phe Val Cys lie 

130 135 140 

Leu He Ser Tyr He Tyr He Thr Asn Ala Val Leu Arg Val Ser Ser 

145 ISO 155 160 

Phe Arg Gly Gly Trp Lys Ala Phe Ser Thr Cye Gly Ser Hia Leu Ala 

165 170 175 

V&l Val Cys Leu Phe Tyr Gly Thr He He Ala Val Tyr Phe Aen Pro 

180 185 190 

Val Ser Ser Hie Ser Ser Glu Lya Asp Thr Ala Ala Thr Val Leu Tyr 
195 200 205 

Thr Val Val Thr Pro Met Leu 
210 215 

(2) INFORMATION FOR SEQ ID NO:27: 

(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH i 461 base pair* 

(B) TYPE: nucleic acid 

(C) STRANDED NESS i single 

(D) TOPOLOGY t linear 

(11) MOLECULE TYPE: CDNA 
(iii) HYPOTHETICAL t YES 
(iv) ANTI-SENSE t NO 

(vi) ORIGINAL SOURCE l 

(A) ORGANISM i rat olfactory epithelium 

(B) STRAIN: Srpague-Dawley rat 

(T) TISSUE TYPE i olfactory epithelium 

(vii) IMMEDIATE SOURCE i 
(B) CLONE: J15 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 2.. 481 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 27: 
T ATC TGC AAC CCT CTG CGC TAC CCA GTG CTC ATG AGO CGC CCC CTC 46 

lie Cys Aan Pro Leu Arg Tyr Pro Val Leu Met Ser Gly Arg Val 

15 10 15 

TGC CTC CTC ATC CTC GTG CCC TCC TCC TTC GGA CCA TCC CTC AAC CCC 94 
Cys Leu Leu Met Val Val Ala Ser Trp Leu Cly Gly Ser Leu Aan Ala 

20 25 30 

TCC ATT CAG ACT TCT CTC ACC CTT CAG TTC CCC TAC TCT GGA TCA CGC 142 
Ser lie Cln Thr Ser Leu Thr Leu Cln Phe Pro Tyr Cys Gly Ser Arg 

35 40 45 

AAC ATC TCC CAC TTC TTC TGT GAG GTC CCC TCC CTG CTG ANN NTG CCC 190 
Lye lie Ser Hie Phe Phe Cys Glu Val Pro Ser Leu Leu Xaa Xaa Ala 

50 55 60 

TGT GCA CAC ACT GAA GCC TAT GAG CAC CTA CTA TTT GTC ACA CCC CTG 238 
Cys Ala Asp Thr Glu Ala Tyr Glu Gin Val Leu Phe Val Thr Gly Val 

65 70 75 

GTG GTC CTC CTG GTC CCC ATT ACA TTC ATT ACT GCC TCT TAT CCC CTC 286 
Val Val Leu Leu Val Pro He Thr Phe He Thr Ala Ser Tyr Ala Leu 
60 85 90 95 

ATC CTC GCT CCT GTG CTC CGA ATG CAC TCT GCC GAG GCC ACT CAG AAC 334 
He Leu Ala Ala Val Leu Arg Met His Ser Ala Glu Gly Ser Gin Lys 

100 10S 110 

GCC CTA GCC ACA TGC TCC TCT CAC CTG ACA GTC GTC AAT CTC TTC TAT 382 
Ala Leu Ala Thr Cye Ser Ser His Leu Thr Val Val Asn Leu Phe Tyr 

115 120 125 

CGG CCC CTT GTC TAC ACC TAC ATG TTA CCT GCT TCC TAT CAC TCA CCA 430 
Gly Pro Leu Val Tyr Thr Tyr Met Leu Pro Ala Ser Tyr His Ser Pro 
130 135 140 
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GGC CAA CAC GAC ATA CTA TCC GTC TTT TAC ACC CTT CTC ACA CCC ATG 478 
Gly Gin Asp Atp lie Val Ser Val Phe Tyr Thr Val Leu Thr Pro Met 
145 150 155 



CTT 
Leu 

160 



(2) IN FORMAT I OH FOR SEQ ID NOs26t 

(1) SEQUENCE CHARACTERISTICS » 

(A) LENGTH i 160 amino acids 

(B) TYPE: amino acid 
(0) TOPOLOGY t linear 

(ii) MOLECULE TYPEt protsin 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 28: 

lie Cys Aon Pro Leu Arg Tyr Pro Val Leu Met Ser Gly Arg Val Cys 

15 10 15 

Leu Leu Met Val Val Ala Ser Trp Leu Gly Gly Ser Leu Aim Ala Ser 

20 25 30 

He Gin Thr Ser Leu Thr Leu Gin Phe Pro Tyr Cya Gly Ser Arg Ly» 
35 40 45 

He Ser His Phe Phe Cys Glu Val Pro Ser Leu Leu Xaa Xaa Ala Cya 

50 55 60 

Ala Aep Thr Glu Ala Tyr Glu Gin Val Leu Phe Val Thr Gly Val Val 

65 70 75 80 

Val Leu Leu Val Pro He Thr Phe He Thr Ala Ser Tyr Ala Leu He 

85 90 95 

Leu Ala Ala Val Leu Arg Met His Ser Ala Glu Gly Ser Gin Lys Ala 

100 105 110 

Leu Ala Thr Cys Ser Ser His Leu Thr Val Val Asn Leu Phe Tyr Gly 

115 120 125 

Pro Leu Val Tyr Thr Tyr Met Leu Pro Ala Ser Tyr His Ser Pro Gly 
130 135 140 

Gin Aep Aep He Val Ser Val Phe Tyr Thr Val Leu Thr Pro Met Leu 

145 150 155 160 



(2 ) INFORMATION FOR SEQ ID NO:29: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 481 base pairs 

(B) TYPE: nucleic acid 

(C) STRAWDEDNESS : single 
<D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: CDNA 
(iii) HYPOTHETICAL: YES 



461 
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(iv> ANTI-SENSIt NO 

(vi) ORIGINAL SOURCE : 

(A) ORGANISMt rat olfactory epithelium 

( B) STRAIN: Srpague-Dawley rat 

(F) TISSUE TYPE i olfactory epithelium 

(vil) IMMEDIATE SOURCE: 
(B) CLONE t J16 

(ix) FEATURE i 

(A) NAME/KEY i CDS 
(S) LOCATION i 2.. 481 



(xi) SEQUENCE DESCRIPTION) SEQ ID N0:29» 

C ATC TGT AGO CCT CTT CAC TAT CCT ACC CTC ATG ACC CAG ACA CTG 46 
lie Cys Arg Pro Lou His Tyr Pro Thr Leu Met Thr Gin Thr Leu 

15 10 15 

TGT CCC AAG ATT CCC ACT GGT TGC TGG TTG GGA GGC TTG GCT GGG CCA 94 
Cys Ala Lye lie Ala Thr Gly Cys Trp Leu Gly Gly Leu Ala Gly Pro 

20 25 30 

GTG GTA GAA ATT TCC TTG GTG TCT CGT CTC CTT TTT TGT GGC CCC AAT 142 
Val Val Glu lie Ser Leu Val Ser Arg Leu Leu Phe Cys Gly Pro Aen 

35 40 45 

CAC ATT CAA CAC ATC TTT TGT GAT TTC CCA CCT GTG CTG AGC TTG GCT 190 

Hie He Gin Hit He Phe Cys Asp Phe Pro Pro Val Leu Ser Leu Ala 

50 55 60 

TGT ACT GAT ACA TCA GTG AAT GTC CTG GTA GAT TTT ATT ATA AAC CTC 238 
Cys Thr Asp Thr Ser Val Asn Val Leu Val Asp Phe He He Asn Leu 

65 70 - 75 

TGC AAG ATC CTG GCC ACC TTC CTG CTG ATC CTG AGC TCC TAC TTG CAG 286 
Cys Lys He Leu Ala Thr Phe Leu Leu He Leu Ser Ser Tyr Leu Gin 
60 85 90 95 

ATA ATC CGC ACA GTG CTC AAG ATT CCT TCA GCT GGA GGC AAG AAG AAA 334 
He He Arg Thr Val Leu Lys He Pro Ser Ala Ala Gly Lye Lys Lys 

100 105 110 

CCA TTC TCC ACT TGT GCC TCC CAT CTC ACT GTG GTT CTC ATC TTC TAT 382 
Ala Phe Ser Thr Cys Ala Ser His Leu Thr Val Val Leu He Phe Tyr 

115 120 125 

GGG AGC ATC CTT TTC ATG TAT GTG CGG CTG AAG AAG ACT TAC TCC CTT 430 
Gly Ser He Leu Phe Met Tyr Val Arg Leu Lys Lys Thr Tyr Ser Leu 
130 135 140 

GAC TAC GAC AGA GCC TTG GCA GTA GTC TAC TCC GTG GTT ACC CCT TTC 478 
Asp Tyr Aap Arg Ala Leu Ala Val Val Tyr Ser Val Val Thr Pro Phe 
145 150 155 



CTC 
Leu 

160 



461 



(2) INFORMATION FOR SEQ ID NO: 30: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH i 160 amino acid* 

(B) TYPE: amino acid 
(D) TOPOLOGY j linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:30: 

lie Cy» Arg Pro Leu Hi* Tyr Pro Thr Leu Met Thr Gin Thr Leu Cys 

15 10 15 

Ala Lye lie Ala Thr Cly Cys Trp Leu Gly Gly Leu Ala Gly Pro Val 

20 25 30 

Val Glu lie Ser Leu Val Ser Arg Leu Leu Phe Cys Gly Pro Asn His 

35 40 45 

lie Gin Hi* lie Phe eye Aep Phe Pro Pro Val Leu Ser Leu Ala Cys 

50 55 60 

Thr Aep Thr Ser Val Asn Val Leu Val Asp Phe lie lie Aan Leu Cys 

65 70 75 80 

Lys He Leu Ala Thr Phe Leu Leu lie Leu Ser Ser Tyr Leu Gin He 

65 90 95 

He Arg Thr Val Leu Lye He Pro Ser Ala Ala Gly Lys Lys Lys Ala 

100 105 110 

Phe Ser Thr Cys Ala Ser His Leu Thr Val Val Leu He Phe Tyr Gly 

115 120 125 

Ser He Leu Phe Met Tyr Val Arg Leu Lys Lys Thr Tyr Ser Leu Asp 

130 135 140 

Tyr Aep Arg Ala Leu Ala Val Val Tyr Ser Val Val Thr Pro Phe Leu 

145 150 155 160 

(2) INFORMATION FOR SEQ ID NO: 31: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 4B1 base pairs 

(B) TYPEt nucleic acid 

(C) STRANDED NESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: YES 
(iv) ANTI -SENSE: NO 

(vij ORIGINAL SOURCE: 

(A) ORGANISM: rat olfactory epithelium 

(B) STRAIN: Srpague-Dawley rat 

(F) TISSUE TYPE: olfactory epithelium 

(vii) IMMEDIATE SOURCE: 
(B) CLONE: J17 
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(ix) FEATURE: 

(A) NAME/KEY: CDS 

( B ) LOCATIONi 2, .481 

(xi) SEQUENCE DESCRIPTION: SKQ ID NO: 31 J 

A ATC TGC AAC CCA CTC CTT TAT TCC ACC AAA ATG TCC ACA CAA CTC 46 
lie Cys A»n Pro Leu Leu Tyr Ser Thr Lys Met Ser Thr Gin Val 

15 10 15 

TGT ATC CAC TTC CTT CCA CGA TCT TAT ATA CGG CCT TTT CTT AAT ACT 94 
Cys lie Gin Leu Val Ala Cly Ser Tyr lie Gly Gly Phe Leu Asn Thr 

20 25 30 



TGC CTC ATC ATC TTT TAC TTT TTC TCT TTT CTC TTC TGT GGC CCA AAT 142 
Cys Leu lie Met Phe Tyr Phe Phe Ser Phe Lau Phe Cys Gly Pro Aen 

35 40 45 

ATA GTT GAT CAT TTT TTC TGT GAT TTT GCT CCT TTH HTG GAA CTT TCG 190 
He Val Asp Hie Phe Phe Cys Asp Phe Ala Pro Xaa Xaa Glu Leu Ser 
50 55 60 

TGC TCT GAT GTC ACT GTC TCT GTA GTT GTT ATG TCA TTT TCT GCT GGC 238 
Cys Ser Asp Val Ser Val Ser Val Val Val Met Ser Phe Ser Ala Gly 
65 70 75 

TCA GTT ACT ATG ATC ACA GTC TTT ATC ATA GCC ATC TCC TAT TCT TAC 286 
Ser Val Thr Met He Thr Val Phe lie lie Ala He Ser Tyr Ser Tyr 
80 85 90 95 

ATC CTC ATC ACC ATC CTC AAG ATG TCC TCA ACT GAG GGC CCT CAC AAG 334 
He Leu He Thr He Leu Lye Met Ser Ser Thr Glu Gly Arg His Lys 

100 105 110 

GCT TTC TCC ACA TGT ACC TCC CAC CTC ACT GCA GTC ACT CTC TAC TAT 382 
Ala Phe Ser Thr Cys Thr Ser His Leu Thr Ala Val Thr Leu Tyr Tyr 

115 120 125 

GGC ACC ATT ACC TTC ATT TAT GTC ATG CCC AAG TCC ACA TAC TCT ACA 430 
Gly Thr He Thr Phe He Tyr Val Met pro Lys Ser Thr Tyr Ser Thr 

130 135 140 

GAC CAG AAC AAG GTG CTG TCT GTG TTT TAC ATG GTG GTG ATC CCA ATG 478 
Asp Gin Asn Lys Val Val Ser Val Phe Tyr Met Val Val He Pro Met 

145 150 155 



TTG 
Leu 
160 



<2J INFORMATION FOR SEQ ID NO: 32: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 160 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 32 



481 
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Ile Cys Aon Pro Leu Leu Tyr Ser Thr Lys Met Ser Thx Gin Val Cy» 

15 10 15 

lie Gin Leu Val Al4 Gly Ser tyr lis Cly Gly Phe Leu Aan Thr Cys 

20 25 30 

Leu He Met Phe Tyr Phe Phe Ser Phe Leu Phe Cy» Gly Pro Aan He 

35 40 45 

Val Asp His Phe Phe Cys Asp Phe Alt Pro Xaa Xaa Glu Leu Ser Cys 

SO 55 60 

Ser Asp Val Ser Val Ser Val Val Val Met Ser Phe Ser Ala Gly Ser 

65 70 75 80 

Val Thr Met lis Thr Val Phe He lis Ala lis Ser Tyr Ser Tyr lis 

85 90 95 

Leu He Thr He Leu Lys Met Ser Ser Thr Glu Gly Arg His Lys Ala 

100 105 110 

Phe Ser Thr Cys Thr Ser His Leu Thr Ala Val Thr Leu Tyr Tyr Gly 
115 120 125 

Thr He Thr Phe He Tyr Val Met Pro Lys Ser Thr Tyr Ser Thr Asp 
130 135 140 



Gin Asn Lys Val Val Ser Val Phe Tyr Met Val Val He Pro Met Leu 
145 150 155 160 



(2) INFORMATION FOR SEQ ID NO: 33: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 479 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND ED NESS : single 
(0) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: YES 

(iv) ANTI-SENSE: NO 

<vi) ORIGINAL SOURCE: 

(A) ORGANISM: rat olfactory epithelium 

(B) STRAIN: Srpague-Dawley rat 

(F) TISSUE TYPE: olfactory epithelium 

<vii) IMMEDIATE SOURCE: 
<B) CLONE: J19 

<ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 2.. 479 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 33: 

T ATC TGC CAC CCT CTG AAG TAC ACA GTT ATC ATG AAT CAC TAT TTT 46 
lie Cye Hie Pro Leu Lyo Tyr Thr Val He Met Asn His Tyr Phe 
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10 



15 



TGT GTG ATG CTG CTC CTC TTC TCT GTG TTC GTT AGC ATT GCA CAT GCG 
Cye Val Met Leu Lau Leu Phe Ser Val Phe Val Ser lie Ala Hi* Ala 

20 25 30 



94 



TTG TTC CAC ATT TTA ATG GTG TTG ATA CTG ACT TTC AGC ACA AAA ACT 
Leu Phe His lie Leu Met Val Leu lie Leu Thr Phe Ser Thr Lye Thr 

35 40 45 



142 



GAA ATC CCT CAC TTT TTC TGT GAG CTG GCT CAT ATC ATC AAA CTT ACC 
Glu lie Pro Hie Phe Phe Cye Glu Leu Ala Hie He He Lye Leu Thr 

50 55 60 

TGT TCC GAT AAT TTT ATC AAC TAT CTG CTG ATA TAC ACA GAG TCT GTC 
Cye Ser Asp Aen Phe He Aon Tyr Leu Leu He Tyr Thr Glu Ser Val 

65 70 75 

TTA TTT TTT GGT GTT CAT ATT GTA GGG ATC ATT TTG TCT TAT ATT TAC 
Leu Phe Phe Gly Val His lie Val Gly He He Leu Ser Tyr He Tyr 
80 85 90 95 



190 



238 



286 



ACT GTA TCC TCA GTT TTA AGA ATG TCA TTA TTG GGA GGA ATG TAT AAA 
Thr Val Ser Ser Val Leu Arg Met Ser Leu Leu Gly Gly Met Tyr Lys 

100 105 no 

GCC TTT TCA ACA TGT GGA TCT CAT TTG TCG GTT GTC TCT GTT TTA TGG 
Ala Phe Ser Thr Cye Gly Ser Hie Leu Ser Val Val Ser Val Leu Trp 

115 120 125 

CAC AGG TTT TGG GGT ACA CAT AAG CTC TCC ACT TAC TGA CTC TCC AAG 
Hie Arg Phe Trp Gly Thr His Lye Leu Ser Thr Tyr * Leu Ser Lye 
130 135 140 

GAA GAC TGT AGT GGC TTC AGT GAT GTA CAC TGT GGT TAC TCA GAT GCT G 
Glu ABp Cys Ser Gly Phe Ser Asp Val His Cys Gly Tyr Ser Asp Ala 
145 150 155 



334 



382 



430 



479 



(2) INFORMATION FOR SEQ ID NO: 34: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 159 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:34: 

He Cys Hie Pro Leu Lys Tyr Thr Val He Met Asn His Tyr Phe Cys 
15 10 15 

Val Met Leu Leu Leu Phe Ser Val Phe Val Ser He Ala His Ala Leu 

20 25 30 

Phe HiB He Leu Met Val Leu He Leu Thr Phe Ser Thr Lys Thr Glu 
35 40 45 

He Pro His Phe Phe Cys Glu Leu Ala His He He Lys Leu Thr Cys 
50 55 60 



SUBSTITUTE SHEET 



WO 92/17585 



PCI7US92/02741 



-83- 



S *5 A * P A *0 ^ ^ I1# ^ S%f V * 1 ^ 

Phe Phe Gly Vtl Hie lie Val Cly H, llm ^ u Sar Tyr Il % thr 

85 90 | 5 

Val Ser Ser Val Leu Arg Met Ser Leu Leu oi y ciy het Tyr Ly* Ma 

100 105 HQ 1 

Phe Ser Thr Cye Qly Ser HU Uu Ser Val Val ser Val Leu Trp HU 

115 120 125 

Arg Phe Trp Cly Thr Hie Ly« Leu Sec Thr Tyr • Leu ser hyn Glu 

130 135 140 

Asp Cya Ser Qly Phe Bar Asp Val Hi* Cya Oly Tyr Ser Asp Ala 

145 150 155 

(2) INFORMATION FOR SEQ ID NO: 35: 

<i> SEQUENCE CHARACTERISTICS! 

(A) LENGTH: 481 base pairs 
f B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: YES 
(iv) ANTI -SENSE: NO 

<vi) ORIGINAL SOURCE: 

(A) ORGANISM: rat olfactory epithelium 
(8) STRAIN: Srpague-Davley rat 

(F) TISSUE TYPE: olfactory epithelium 

(vii) IMMEDIATE SOURCE: 

(B) CLONE: J20 

(ix) FEATURE: 

(A) NAME /KEY: CDS 
<B) LOCATION: 2.. 481 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 35: 

A ATC TCC TAC CCA CTG ACG TAC CTT CTC ATC ATC AGC TGC GTG GTG 46 
I la Cya Tyr Pro Leu Arg Tyr Leu Leu lie Met Ser Trp Val Val 

15 10 15 

TGC ACA GCA CTG TCC GTG CCA ATC TGC CTC ATA CGC TTT TGT GCC TCC 94 
Cys Thr Ala Leu Ser Val Ala He Trp Val He Gly Phe Cya Ala Ser 

20 25 30 

GTT ATA CCT CTC TGC TTC ACG ATC CTC CCA CTC TGT GGT CCT TAC GTC 142 
Val He Pro Leu Cy» Phe Thr He Leu Pro Leu Cya Gly Pro Tyr Val 

35 40 45 

GTT GAT TAT CTT TTC TGC GAG CTC CCC ATC CTT CTG CAC CTG TTC TGC 190 
Val Aap Tyr Leu Phe Cya Glu Leu Pro He Leu Leu Hie Leu Phe Cya 

50 55 60 

ACA CAT ACA TCT CTC CTC GAG KNN NNN NNN NNN NNN NNN NNN NNN NNN 238 
Thr Asp Thr Ser Leu Leu Glu Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 

65 70 75 
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NNN NKK HUH KNN NNN CCC TTC CTC CTG ATT CTT CTC TCC TAC CTT CGC 286 
Xaa Xaa Xaa Xaa Xaa Pro Phe Leu Leu He Val Leu Ser Tyr Leu Arg 
80 85 90 95 

ATC CTC GTC CCT CTC ATA ACA ATA CAC TCA GCT GAG GGC AG A AAA AAG 334 
He Leu Val Ala Val He Arg He Aap Ser Ala Glu Cly Arg Lys Lys 

100 105 110 

CCC TTT TCA ACT TOT GCT TCA CAC TTG GCT GTG GTC ACC ATC TAC TAT 362 
Ala Phe Ser Thr Cys Ala Sar His Lou Ala Val Val Thr He Tyr Tyr 

115 120 125 

GGA ACA GOG CTG ATC AGG TAC TTG AGG CCC AAG TCC CTT TAT TCC GCT 430 
Gly Thr Gly Lou I la Arg Tyr Leu Arg Pro Lys Ser Leu Tyr Ser Ala 
130 135 140 

GAG GGA GAC AGA CTG ATC TCT GTG TTC TAT GCA GTC ATT GGC CCT GCA 478 
Glu Gly Asp Arg Leu He Ser Val Phe Tyr Ala Val He Gly Pro Ala 
145 150 155 



CTG 
Leu 
160 

(2) INFORMATION FOR SEQ ID NO:36: 

(1) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 160 amino acids 

(B) TYPE: amino acid 
<D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 36: 

He Cya Tyr Pro Leu Arg Tyr Leu Leu He Met Ser Trp Val Val Cys 

15 10 IS 

Thr Ala Leu Ser Val Ala He Trp Val He Gly Phe Cys Ala Ser Val 

20 25 30 

He Pro Leu Cyu Phe Thr He Leu Pro Leu Cya Gly Pro Tyr Val Val 

35 40 45 

Asp Tyr Leu Phe Cya Glu Leu Pro He Leu Leu Hia Leu Phe Cya Thr 

50 55 60 

Asp Thr Ser Leu Leu Glu Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 

65 70 75 SO 

Xaa Xaa Xaa Xaa Pro Phe Leu Leu He Val Leu Ser Tyr Leu Arg He 

85 90 95 

Leu Val Ala Val He Arg He Aap Ser Ala Glu Gly Arg Lya Lys Ala 

100 105 HO 

Phe Ser Thr Cya Ala Ser Hia Leu Ala Val Val Thr He Tyr Tyr Gly 
115 120 125 

Thr Gly Leu He Arg Tyr Leu Arg Pro Lys ser Leu Tyr ser Ala Glu 
130 135 140 

Gly Aep Arg Leu He Ser Val Phe Tyr Ala Val He Gly Pro Ala Leu 
145 150 155 160 



481 
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What is claimed is: 

1. An isolated nucleic acid molecule encoding an odorant 
receptor. 

5 

2. An isolated DNA of claim 1* 

3. An isolated cDKA of claim 2. 

10 4. An isolated cDNA of claim 3 wherein the sequence of the 

cDNA has the nucleotide sequence shown in Figure 9. 

5. An isolated cDNA of claim 3 wherein the sequence of the 
cDNA has the nucleotide sequence shown in Figure 10. 

15 

6. An isolated cDNA of claim 3 wherein the sequence of the 
cDNA has the nucleotide sequence shown in Figure 11. 

7. An isolated cDNA of claim 3 wherein the sequence of the 
20 cDNA has the nucleotide sequence shown in Figure 12. 

8. An isolated cOKA of claim 3 wherein the sequence of the 
cDNA has the nucleotide sequence shown in Figure 13. 

25 9. An isolated cDNA of claim 3 wherein the sequence of the 

cDNA has the nucleotide sequence shown in Figure 14. 

10. An isolated cDNA of claim 3 wherein the sequence of the 
cDNA has the nucleotide sequence shown in Figure 15. 

30 

11. An isolated cDNA of claim 3 wherein the sequence of the 
cDNA has the nucleotide sequence shown in Figure 16. 

12. An isolated cDNA of claim 3 wherein the sequence of the 
35 cDNA has the nucleotide sequence shown in Figure 17. 
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13. An isolated cDNA of claim 3 wherein the sequence of the 
cDNA has the nucleotide sequence shown in Figure 18. 

14. An isolated cDKA of claim 3 wherein the sequence of the 
5 cDNA has the nucleotide sequence shown in Figure 19. 

15. An isolated cDNA of claim 3 wherein the sequence of the 
cDNA has the nucleotide sequence shown in Figure 20, 

10 16. An isolated cDNA of claim 3 wherein the sequence of the 

cDNA has the nucleotide sequence shown in Figure 21. 

* 

17. An isolated cDNA of claim 3 wherein the sequence of the 
cDNA has the nucleotide sequence shown in Figure 22. 

15 

18. An isolated cDNA of claim 3 wherein the sequence of the 
cDNA has the nucleotide sequence shown in Figure 23. 

19. An isolated cDNA of claim 3 wherein the sequence of the 
20 cDNA has the nucleotide sequence shown in Figure 24. 

20. An isolated cDNA of claim 3 wherein the sequence of the 
cDNA has the nucleotide sequence shown in Figure 25. 

2 5 21. An isolated cDNA of claim 3 wherein the sequence of the 

cDNA has the nucleotide sequence shown in Figure 26. 

22. An isolated cDNA of claim 3 wherein the sequence of the 
cDNA has the nucleotide sequence shown in Figure 27. 

30 

23. An isolated cDNA of claim 3 wherein the sequence of the 
cDNA has the nucleotide sequence shown in Figure 28. 

24. An isolated cDNA of claim 3 wherein the sequence of the 
35 cDNA has the nucleotide sequence shown in Figure 29. 



SUBSTITUTE SHEET 



WO 92/17585 PCT/US92/02741 

-87- 



25. An isolated cDNA of claim 3 wherein the sequence of the 
cDNA has the nucleotide sequence shown in Figure 30. 

26. An isolated cDNA of claim 3 wherein the sequence of the 
5 cDNA has the nucleotide sequence shown in Figure 31. 

27. An isolated cDNA of claim 3 encoding an insect odorant 
receptor. 

10 28. An isolated cDNA of claim 3 encoding a vertebrate 

odorant receptor. 

29. An isolated cDNA of claim 3 encoding a fish odorant 
receptor. 

15 

30. An isolated cDNA of claim 3 encoding a mammalian 
odorant receptor. 

31. An isolated cDNA of claim 30 wherein the mammalian 
20 odorant receptor is a human odorant receptor. 

32. An isolated cDNA of claim 30 wherein the mammalian 
odorant receptor is a rat, dog or mouse odorant 
receptor. 

25 

33. An expression vector comprising the cDNA of claim 3 and 
the sequence elements necessary for replication and 
expression in a suitable host. 

30 34 . An expression vector comprising the cDNA of any of 

claims 4-19 and the sequence elements necessary for 
replication and expression in a suitable host. 
35. A purified protein encoding an odorant receptor. 



35 
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A purified protein of claim 35 wherein the amino acid 
sequence is the sequence in Figure 9. 

A purified protein of claim 35 wherein the amino acid 
sequence is the sequence in Figure 10. 

A purified protein of claim 35 wherein the amino acid 
sequence is the sequence in Figure 11. 

A purified protein of claim 35 wherein the amino acid 
sequence is the sequence in Figure 12. 

A purified protein of claim 35 wherein the amino acid 
sequence is the sequence in Figure 13. 

A purified protein of claim 35 wherein the amino acid 
sequence is the sequence in Figure 14. 

A purified protein of claim 35 wherein the amino acid 
sequence is the sequence in Figure 15. 

A purified protein of claim 35 wherein the amino acid 
sequence is the sequence in Figure 16. 

A purified protein of claim 35 wherein the amino acid 
sequence is the sequence in Figure 17. 

A purified protein of claim 35 wherein the amino acid 
sequence is the sequence in Figure 18. 

A purified protein of claim 35 wherein the amino acid 
sequence is the sequence in Figure 19. 

A purified protein of claim 35 wherein the amino acid 
sequence is the sequence in Figure 20. 
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48. A purified protein of claim 35 wherein the amino acid 
sequence is the sequence in Figure 21. 

49. A purified protein of claim 35 wherein the amino acid 
5 sequence is the sequence in Figure 22. 

50. A purified protein of claim 35 wherein the amino acid 
sequence is the sequence in Figure 23. 

10 51. A purified protein of claim 35 wherein the amino acid 

sequence is the sequence in Figure 24. 

52. A purified protein of claim 35 wherein the amino acid 
sequence is the sequence in Figure 25. 

15 

53. A purified protein of claim 35 wherein the amino acid 
sequence is the sequence in Figure 26. 

54. A purified protein of claim 35 wherein the amino acid 
20 sequence is the sequence in Figure 27. 

55. A purified protein of claim 35 wherein the amino acid 
sequence is the sequence in Figure 28. 

25 56. A purified protein of claim 35 wherein the amino acid 

sequence is the sequence in Figure 29. 

57. A purified protein of claim 35 wherein the amino acid 
sequence is the sequence in Figure 31. 

30 

58. A purified protein of claim 3 5 encoding an insect 
odorant receptor. 

59. A purified protein of claim 35 encoding a vertebrate 
35 odorant receptor. 



SUBSTITUTE SHEET 



WO 92/17585 PCT/US92/02741 

-90- 



60. A purified protein of claim 35 encoding a fish odorant 
receptor* 

61. A purified protein of claim 35 encoding a mammalian 
5 odorant receptor. 

62. A purified protein of claim 61 wherein the mammalian 
odorant receptor is a human odorant receptor. 

10 63. A purified protein of claim 61 wherein the mammalian 

odorant receptor is a rat, dog or mouse odorant 
receptor ♦ 

64. A purified protein of claim 35 which has 7 
15 transmembrane regions and whose third cytoplasmic loop 

from the N-terminus is approximately 17 amino acid 
long. 

65. A method of transforming cells which comprises 
20 transfecting a host cell with a suitable expression 

vector of claim 33, 

66. A method of transforming cells which comprises 
transfecting a host cell with a suitable expression 

25 vector of claim 34. 

67. Cells transformed by the method of claim 65. 

68. Transformed cells of claim 67 wherein the cells are 
30 olfactory cells. 

69. Transformed cells of claim 67 wherein the cells are 
non-olfactory cells. 



35 
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A method of identifying a desired odorant ligand 
comprising contacting transformed non-olfactory cells 
of claim 69, expressing a known odorant receptor with 
a series of odorant ligands and determining which 
ligands bind to the receptors present on the non- 
olfactory cells. 

A method of identifying a desired odorant receptor 
comprising contacting a series of transformed non- 
olfactory cells of claim 69 with a known odorant ligand 
and determining which odorant receptor binds with the 
odorant ligand. 

A method of detecting an odor which comprises: 

a) identifying a odorant receptor which binds the 
desired odorant ligand by the method of claim 71 
and; 

b) imbedding the receptor in a membrane such that 
when the odorant ligand binds with the receptor 
identified in a) above, a detectable signal is 
produced. 

A method of claim 72 wherein the desired odorant is a 
pheromone. 

A method of claim 72 wherein the desired odorant ligand 
is the vapors emanating from cocaine, marijuana, 
heroin, hashish, or angel dust. 

A method of claim 72 wherein the desired odorant ligand 
is the vapors emanating from gasoline, natural gas or 
alcohol. 
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A method of claim 72 wherein the desired odorant ligand 
is the vapors emanating from decayed human flesh. 

A method of claim 72 wherein the desired odorant ligand 
is the vapors emanating from explosives, plastic 
explosives, firearms, or gun powder. 

A method of claim 72 wherein the desired odorant ligand 
is toxic fumes, noxious fumes or dangerous fumes. 

A method of claim 72 wherein the membrane is a cell 
membrane. 

A method of claim 72 wherein the membrane is an 
olfactory cell membrane. 

A method of claim 72 wherein the membrane is a 
synthetic membrane. 

A method of claim 72 wherein the detectable signal is 
a color change, phosporescence, or radioactivity. 

A method of quantifying the amount of an odorant ligand 
present in a sample which comprises the method of claim 
72 wherein the detectable signal is quantified. 

A method of developing fragrances which comprises 
identifying a desired odorant receptor by the method of 
claim 71 then contacting non-olfactory cells, which 
have been trans feet ed with an expression vector 
containing the cDNA of the desired odorant receptor 
such that the receptor is expressed upon the surface of 
the non-olfactory cell, with a series of compounds to 
d termine which ones bind with the receptor. 
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85. A method of identifying an odorant fingerprint which 
comprises contacting a series of cells, which have been 
transformed such that each express a known odorant 
receptor, with a desired sample and determining the 

5 type and quantity of the odorant ligands present in the 

sample. 

86. A method of identifying odorant ligands which inhibit 
the activity of a desired odorant receptor which 

10 comprises contacting the desired odorant receptor with 

a series of compounds and determining which compounds 
inhibit the odorant ligand - odorant receptor 
interaction. 

15 87. A method of identifying appetite suppressant compounds 

which comprises identifying odorant ligands by the 
method of claim 86 wherein the desired odorant receptor 
is that which is associated with the perception of 
food. 



20 



25 



88. A method of controlling appetite in a subject which 
comprises contacting the olfactory epithelium of the 
subject with the odorant ligands identified by the 
method of claim 87. 

89. A nasal spray, to control appetite comprising the 
compounds identified by the method of claim 87 in a 
suitable carrier. 



30 90. A method of trapping odors which comprises contacting 

a membrane which contains multiples of the desired 
odorant receptor, with a sample such that the desired 
odorant ligand is absorbed by the binding of the 
odorant ligand to the odorant receptor. 

35 
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An odor trap employing the method of claim 90. 

A method of controlling pest populations which 
comprises identifying odorant ligands by the method of 
claim 70 which are alarm odorant ligands and spraying 
the desired area with the identified odorant ligands. 

A method of controlling a pest population which 
comprises identifying odorant ligands by the method of 
claim 70 which interfere with the interaction between 
the odorant ligands and the odorant receptors which are 
associated with fertility. 

A method of claim 92 or 93 wherein the pest population 
is a population of insects. 

A method of claim 92 or 93 wherein the pest population 
is a population of rodents. 

A method of claim 95 wherein the population of rodents 
is a population of mice or rats. 

A method of promoting fertility which comprises 
employing the method of claim 70 to identify odorant 
ligands which interact with the odorant receptors 
associated with fertility and administering the 
identified odorant ligands to a subject. 

A method of inhibiting fertility which comprises 
employing the method of claim 70 to identifying odorant 
ligands which inhibit the interaction between the 
odorant ligands and the odorant receptors associated 
with fertility and administering the identified odorant 
ligands to a subject. 
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Figure 3 
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Figur 4A 



* 



F3 








H 


D 


S 


S 


N 


R 


T 


R 


V 


S 


E 


11 


F5 








M 


S 


S 


T 


N 


Q 


S 


S 


V 


T 


E 


11 


F6 


N 


A 


W 


S 


T 


G 


Q 


N 


L 


S 


T 


P 


G 


P 


14 


F12 






M 


E 


S 


G 


N 


S 


T 


R 


R 


F 


S 


S 


12 


13 








M 


N 


- 


- 


N 


Q 


T 


F 


I 


T 


Q 


9 


17 






N 


£ 


R 


R 


N 


H 


S 


G 


R 


V 


S 


E 


12 


18 








H 


N 






N 


K 


T 


V 


I 


T 


H 


9 


18 








N 


T 


R 


R 


N 


Q 


T 


A 


I 


S 


Q 


11 


114 








M 


T 


G 


N 


N 


Q 


T 


L 


I 


L 


E 


11 


115 








M 


T 


£ 


E 


N 


Q 


T 


V 


I 


S 


Q 


11 


F3 


P 


L 


L 


L 


G 


F 


V 


E 


N 


K 


D 


L 


Q 


P 


25 


F5 


F 


L 


L 


L 


G 


L 


S 


R 


Q 


P 


Q 


Q 


Q 


Q 


25 


F6 


F 


I 


L 


L 


G 


F 


P 


G 


P 


R 


S 


M 


R 


I 


28 


F12 


F 


F 


L 


L 


G 


F 


T 


E 


N 


P 


Q 


L 


H 


F 


26 


13 


F 


L 


L 


L 


G 


L 


P 


I 


P 


E 


E 


H 


Q 


H 


23 


17 


F 


V 


L 


L 


G 


F 


P 


A 


P 


A 


P 


L 


R 


V 


26 


18 


F 


L 


L 


L 


G 


L 


P 


I 


P 


P 


E 


H 


Q 


Q 


23 


19 


F 


F 


L 


L 


G 


L 


P 


F 


P 


P 


E 


Y 


Q 


H 


25 


114 


F 


L 


L 


L 


G 


L 


P 


I 


P 


S 


E 


Y 


H 


L 


25 


115 


F 


L 


L 


L 


F 


L 


P 


I 


P 


S 


E 


H 


Q 


H 


25 



SUBSTITUTE SHEET 



WO 92/17585 



PCT/US92/02741 



6/99 

Figur 4B 
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Figure 4C 
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FIGURE 19A 

Translated Sequence of H5.D1S 
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15 10 15 

Ala Ala Lau Val Thr Val Sar Trp Val Thr Cly Val Gly Thr Cly Pha 

20 25 3Q 

leu Pro Sar Lau Lau Ila Sar Lya Lau Aap Pha Cya Gly Pro Aan Arg 

35 40 45 

Ila Aan Hia Pha Ph* Cya Aap Lau Pro Pro Lau I la Gin Lau Sar cya 

50 55 60 

Sar Sar Val Pha Val Thr Clu Mat Ala Ila Pha Val Lau Sar Ila AU 

65 70 75 60 

val Lau Cya Ila Cya Pha Lau Lau Thr Xaa Xaa Sar Tyr Ila Pha Ila 

65 90 95 

val Sar Sar Ila Leu Arg Ila Pro Set Thr Thr Gly Arg Mat Lya Thr 

100 105 1X0 

Pha Sar Thr Cya Gly Sar Hia Lau Ala Val Val Thr Ila Tyr Tyr Cly 

115 120 125 

Thr Mat Ila Sar Mat Tyr Val Gly Pro Aan Ala Hia Lau Sar Pro Glu 

130 135 140 

Lau Aan Lya Val Ila Sar Val Pha Tyr Thr Val Ila Thr Pro Lau Lau 

14S 150 155 160 

(2) INFORMATION FOR SEQ ID NO: 23: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH i 646 baa* paira 

(B) TYPE i nuclaic acid 

(C) STRANOEDNESS : aingl* 

(D) TOPOLOGY i linaar 

(ii) MOLECULE TYPEi protain 
(ill) HYPOTHETICAL: YES 
(iv) ANTI-SENS* t NO 

(vi) ORIGINAL SOURCE) 

(A) ORGANXSMt rat olfactory apithaliun 

( B) STRAIN i Srpagua-Dawlay rat 

(F) TISSUE TYPE: olfactory apithaliua 

(vii) IMMEDIATE SOURCE: 
(B) CLONE i Jll 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 2.. 646 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23: 

N GTC TGC TTC TCC TCC ACC ACT GTC CCC AAG GTA CTG GCT AAC CAC 46 

Val Cya Phe Ser Sar Thr Thr Val Pro Lya Val Lau Ala Aan Hia 

15 10 15 
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ATA 
Ila 


CTC 
Lau 


ACT 

Smr 


AOT 
Smr 


CAO 
Gin 
20 


OCC 

Ala 


ATT 

Ila 


TCC 
Sar 


TTC 
Pha 


TCT 
Sar 

25 


GGG 
Gly 


TGT 

Cya 


CTA 

Lau 


ACT 
Thr 


CAG 

Gin 
30 


CTG 

Lau 


94 


TAT 

Tyr 


TTT 
Pha 


CTC 
Lau 


TGT 
Cya 

35 


GTG 
Val 


TCT 
Smr 


GTG 

val 


AAT 
Aan 


ATC 
Mat 
40 


GAC 

Aap 


AAT 
Aan 


TTC 
Pha 


CTC 

Lau 


CTG 
Lau 
45 


GCT 

Ala 


CTG 
Val 


142 


ATG 
Mat 


CCC 
Ala 


TAT 

Tyr 
50 


GAC 
Asp 


ACA 
Arg 


TTT 
Pha 


GTG 

Val 


GCC 
Ala 
55 


ATA 
Ila 


TGC 
Cya 


CAC 
Bla 


CCT 
Pro 


TTG 
Lau 
60 


TAC 

Tyr 


TAC 

Tyr 


ACA 

Thr 


190 


AGA 

Thr 


AAG 
Lya 

65 


ATG 
Mat 


ACC 
Thr 


CAC 
Hia 


CAG 
Gin 


CTC 
Lau 

70 


TGT 
Cya 


CTC 
Val 


TTO 
Lau 


CTC 
Lau 


GTG 
Val 

75 


TCT 

Sar 


GGA 
Gly 


TCA 

sar 


NNN 

Xaa 


236 


NNN 
80 


NNN 


NNN 


NNN 


NNN 


NNN 

85 


NNN 


NNN 


NNN 


NNN 


NNN 

90 


NNN 


NNN 


NNN 


NNN 


NNN 
95 


266 


NNN 


NNN 


NNN 


NNN 


NNN 


NNN 


NNN 


NNN 


NNN 


NNN 


NNN 


NNN 


NNN 


NNN 


NNN 


NNN 


334 










100 










105 










110 






NNN 


NNN 


NNN 


NNN 


NNN 


NNN 


NNN 


NNN 


NNN 


NNN 


NNN 


NNN 


NNN 


NNN 


NNN 


NNN 


362 








115 










120 










125 








NNN 
Xaa 


NNN 
Xaa 


NNN 
Xaa 

130 


NNN 
Xaa 


NNN 
Xaa 


NNN 

Xaa 


NNT 
Xaa 


GTG 
Val 
135 


ATC 
Ila 


ATG 
Mat 


CTC 
Val 


ACC 
Thr 


CCA 
Pro 
140 


TTT 
Pha 


GTC 
Val 


TGC 

Cya 


430 


ATC 
Ila 


CTC 
Lau 

145 


ATC 
11* 


TCT 
St 


TAC 

Tyr 


ATC 
Ila 


TAC 
Tyr 
ISO 


ATC 
Ila 


ACC 

Thr 


AAT 

Aan 


GCA 
Ala 


GTC 
Val 
155 


CTC 
Lau 


AGA 
Arg 


GTC 
Val 


TCA 

Sar 


478 


TCC 
Sar 

160 


TTT 
Pha 


AGG 
Arg 


GGA 

Gly 


GGA 

Gly 


TGC 
Trp 
165 


AAA 
Lya 


GCC 
Ala 


TTC 
Pha 


TCC 

Sar 


ACC 
Thr 
170 


TGT 
Cya 


GGC 
Gly 


TCA 
Sar 


CAC 
Hia 


CTG 
Lau 

175 


526 


OCT 

Ala 


GTG 
Val 


CTC 
Val 


TGC 
Cya 


CTC 
Lau 
180 


TTC 

Pha 


TAT 

Tyr 


GGC 
Gly 


ACC 
Thr 


ATC 

Ila 

185 


ATT 

Ila 


OCT 

Ala 


GTG 

val 


TAT 
Tyr 


TTC 
Pha 
190 


AAT 
Aan 


574 


CCT 
Pro 


GTA 
Val 


TCT 

Smr 


TCC 

Sar 

195 


CAT 

Hia 


TCA 

Sar 


TCT 

Sar 


GAG 

Glu 


AAG 

ty« 

200 


GAC 
Aap 


ACT 
Thr 


GCA 

Ala 


GCA 
Ala 


ACT 
Thr 
205 


GTG 

Val 


CTA 

Lau 


622 


TAC 

Tyr 


ACA 

Thr 


GTG 

val 

210 


GTG 

val 


ACT 

Thr 


CCC 
Pro 


ATG 
Mat 


TTG 
Lau 

215 


















646 



(2) INFORMATION FOR SEQ ID NO: 24: 

(i) SEQUENCE CHARACTERISTICS! 

{A) LENGTH: 215 amino acida 
(B) TYPE: amino acid 
( D ) TOPOLOGY: linaar 

(ii> MOLECULE TYPE: protain 
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(xi) SEQUENCE DISRUPTION i 3IQ ID NOi24» 



Val 

1 


Cy. 


Pha 


sar 


Sar 

5 


Thr 


Thr 


Val 


Pro 


Lya 

10 


Val 


Lau 


Ala 


Aan 


Hia 

15 


Ila 


Lau 


Sar 


Sar 


Gin 
20 


Ala 


11* 


Sar 


Pha 


sar 

25 


Gly 


Cya 


Lau 


Thr 


Gin 
30 


Lau 


Tyr 


Pha 


Lau 


Cya 

35 


Val 


Sar 


Val 


Aan 


Mat 

40 


Aap 


Aan 


Pha 


Lau 


Lau 
45 


Ala 


Val 


Mat 


Ala 


Tyr 
SO 


Aap Arg 


Pha 


Val 


Ala 

55 


11* 


Cya 


Hia 


Pro 


Lau 

60 


Tyr 


Tyr 


Thr 


Thr 


Lya 

65 


Hat 




ma 


Gin 


L«u 

70 


Cya 


Val 


Lau 


Lau 


Val 

75 


Sar 


Gly Sar Xaa 


Xaa 

80 










85 










90 










95 










100 










105 










110 










115 










120 










125 








Xaa 


Xaa 

130 


Xa* 


Xaa 


Xaa 


Xaa 


Val 

135 


Ila 


Mat 


Val 


Thr 


Pro 
140 


Pha 


Val 


Cya 


Ila 


Lau 

145 


I la 


Sar 


Tyr 


I la 


Tyr 
ISO 


Ila 


Thr 


Aan 


Ala 


Val Lau Arg Val 

15$ 


Sar 


Sar 

160 


Ph« 


Arg 


Gly 


Gly 


Trp 
16S 


Lya 


Ala 


Pha 


Sar 


Thr 
170 


Cya Gly 


Sar 


Hia 


Lau 

175 


Ala 


val 


Val 


Cya 


Lau 

180 


Pha 


Tyr 


Gly 


Thr 


Ila 

185 


Ila 


Ala 


Val 


Tyr 


Pha 

190 


Aan 


Pro 


val 


Sar 


Sar 

195 


Hia 


Sar 


Sar 


Glu 


Lya 

200 


Aap Thr 


Ala 


Ala 


Thr 
205 


Val 


L«u 


Tyr 


Thr 


Val 

210 


Val 


Thr 


Pro 


M#t 


Lau 

215 




















(2) 


INFORMATION 


FOR 


SIQ 


ID NOi25x 

















(i) SEQUENCE CHARACTERISTICS: 

(A) LtlfGTHi 646 basa paira 

(B) TYPft nuclaic acid 

(C) STRAND EDNESS i alngla 

(D) TOPOLOGY t llnaar 

(ii) MOLECULE TYPEi cDNA 
(iii) HYPOTHETICAL l YES 
(iv) ANTI-SENSE x NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: rat olfactory apithalium 

(B) STRAIN: Srpagua-Dawlay rat 

(P) TISSUE TYPE: olfactory apithalium 
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(vii) IMMEDIATE SOURCE I 
{ B) CLONKi J14 

(ix) FEATURE: 

(X) NAME/KEY I CDS 
<B) LOCATION t 2.. 646 

(xi) SEQUENCE DESCRIPTION t SEQ ID NOi25i 

T CTC TGC TTC TCC TCC ACC ACT CTC CCC AAO OTA CTC OCT AAC CAC 
Val Cya Pha Sar Sar Thr Thr Val Pro Lys Val Leu Ala Asn Hi* 

15 10 is 



ATC GCC TAT CAC AGA TTT CTC CCC ATA TCC CAC CCT TTC TAC TAC ACA 
Met Ala Tyr Asp Arg Pha Val Ala lie eye His Pro Leu Tyr Tyr Thr 
50 55 60 



46 



ATA CTC ACT ACT CAC CCC ATT TCC TTC TCT GCO TOT CTA ACT CAC CTC 94 
lis Leu Ssr Ser Gin Ala lie Sar Phe Sar Cly Cys Lau Thr Gin Lau 

20 25 30 

TAT TTT CTC TCT CTC TCT CTC AAT ATC CAC AAT TTC CTC CTC OCT CTC 142 
Tyr Pha Lau Cys Val Sar Val Asn Mat Asp Asn Pha Lau Leu Ala Val 

35 40 45 



190 



ACA CCC ATC ACC CAC CAC CTC TCT CTC TTC CTC CTC TCT CCA TCA NNN 238 
Thr Pro Mat Thr Hia Cln Lau Cys Val Lau Lau Val Smr Cly Sar Xaa 

65 70 75 

NNN NNN NNN NNN NNN NNN NNN NNN NNN NNN NNN NNN NNN NNN NNN NNN 266 

Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 

80 B5 90 95 

NNN NNN NNN NNN NNN NNN NNN NNN NNN NNN NNN NNN NNN NNN NNN NNN 334 

s\\ JC s\^sl iC ^s aa* ^s> ^s sa> si }C 3Csa\ si 3Ce%^ sa> ales' 2Ces\e% ^Ca% ei ]C aa\ A% A as. V A A V at at V si ai 

100 105 no 

NNN NNN NNN NNN NNN NNN NNN NNN NNN NNN NNN NNN NNN NNN NNN NNN 382 

Xaa Xaa Xaa Xaa xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 

115 120 125 

NNN NNN NNN NNN NNN NNN NNT CTC ATC ATC CTC ACC CCA TTT CTC TGC 430 
Xaa Xaa Xaa Xaa Xaa Xaa Xaa Val I la Mat Val Thr Pro Pha Val Cya 
130 135 140 

ATC CTC ATC TCT TAC ATC TAC ATC ACC AAT CCA CTC CTC ACA CTC TCA 478 
Ila Lau Ila sar Tyr lis Tyr Ila Thr Asn Ala Val Lau Arg Val Sar 

145 150 155 

TCC TTT ACC CCA CCA TCC AAA CCC TTC TCC ACC TCT CCC TCA CAC CTC 526 
Ser Pha Arg Cly Cly Trp Lys Ala Pha Sar Thr Cys Gly Sar Hia Lau 

160 165 170 175 

CCT CTC CTC TCC CTC TTC TAT CCC ACC ATC ATT CCT CTC TAT TTC AAT 574 
Ala Val Val Cys Lau Pha Tyr Cly Thr Ila Ila Ala Val Tyr Pha Aan 

180 185 190 

CCT CTA TCT TCC CAT TCA TCT CAC AAC CAC ACT GCA CCA ACT CTC CTA 622 
Pro Val Ser Sar Hia Sar Sar Clu Lya Asp Thr Ala Ala Thr Val Lau 

195 200 205 



SUBSTITUTE SHEET 



i 

1 



WO 92/17585 



PCI7US92/02741 



-75- 



TAC ACA GTG CTC ACT CCC ATC TTC 
Tyr Thr Val Vai Thr Pro Mat Lau 

210 21S 

(2) IHFORMA7ION FOR SEQ ID NOi26( 

(i) SEQUENCE CHARACTERISTICS* 

(A) LENGTH I 215 Amino acid a 

(B) TYPBi amino acid 
<D) TOPOLOGY i Una at 

(11) MOLECULE TTfPEi protain 

(xl) SEQUENCE DESCRIPTION! 5SQ ID NOt26t 

Val Cya Pha Sar Sar Thr Thr Val Pro Lya Val Lau Ala Aan Hit Ha 

IS 10 15 

Lau Sar Smr Gin Ala Ila Sar Pha St Gly Cya Lau Thr Gin Lau Tyr 

20 25 30 

Pha Lau Cya Val Sar Val Aan Mat Aap Aan Pha Lau Lau Ala Val Mat 

35 40 45 

Ala Tyr Aap Arg Pha Val Ala I la Cya Hit Pro Lau Tyr Tyr Thr Thr 

50 55 60 

Pro Mat Thr Hia Gin Lau Cya Val Lau Lau Val Sar Gly Sar Xaa Xaa 

65 70 75 80 

Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 

85 90 95 

Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 

100 105 110 

Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 

115 120 125 

Xaa Xaa Xaa Xaa Xaa Xaa Val I la Mat Val Thr Pro Pha Val Cya I la 

130 135 140 

Lau I la Sar Tyr I la Tyr I la Thr Aan Ala Val Lau Arg Val Sar Sar 

14$ 150 155 160 

Pha Arg Gly Gly Trp Lya Ala Pha Sar Thr Cya Gly Sar Hia Lau Ala 

165 170 175 

val Val Cya Lau Pha Tyr Gly Thr Ila Ila Ala Val Tyr Pha Aan Pro 

180 185 190 

Val Sar Sar Hia Sar Sar Glu Lya Aap Thr Ala Ala Thr Val Lau Tyr 

195 200 205 

Thr Val val Thr Pro Mat Lau 

210 215 

(2) INFORMATION FOR SEQ ID NO: 27: 
(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH i 481 baaa pair a 

(B) TYPBi nuciaic acid- 
ic) STRAHDBDNESSi aingla 
(D) TOPOLOGY i linaar 

(11) MOLECULE TYPEi CDNA 

(111) HYPOTHETICAL: YES 

(Iv) ANTI -SENSE I NO 

(vi) ORIGINAL SOORCEl 

(A) ORGAN I Mi rat olfactory aplthalium 

(8) STRAIN t Srpagua-Dawlay rat 

if) TISSUE TYPEi olfactory aplthalium 

(vll) IMMEDIATE SOURCE: 
(8) CLONE: J15 

(ix) FEATURE: 

(A) NAME/KEY : CDS 

(B) LOCATION] 2.. 481 



(xl) SEQUENCE DESCRIPTION: SEC; ID NO: 27: 

T ATC TCC AAC CCT CTC CGC TAC CCA GTC CTC ATC ACC GGC CGG CTG 46 
XX« Cya Aan Pro Lau Arc, Tyr Pro Val Lau Hat Sar Gly Arg Val 

15 10 15 

TCC CTG CTC ATC GTC GTG GCC TCC TOG TTC GGA GGA TCC CTC AAC GCC 94 
Cya Lau Lau Hat Val Val Ala Sar Trp Lau Gly Gly Sar Lau Aan Ala 

20 25 30 

TCC ATT CAG ACT TCT CTC ACC CTT CAG TTC CCC TAC TOT GGA TCA CGG 142 

Sar Ila Gin Thr Sar Lau Thr 'Uu Cln Pha Pro Tyr Cya Gly Sar Arg 

35 40 45 

AAC ATC TCC CAC TTC TTC TGT GAG GTC CCC TCC CTG CTC ANN NTG GCC 190 
Lya Ila Sar Hla Pha Pha Cya Glu Val Pro Sar Lau Lau Xaa Xaa Ala 

50 55 60 

TGT CCA GAC ACT GAA GCC TAT GAG CAG GTA CTA TTT GTG ACA GGC GTG 238 
Cya Ala Aap Thr Glu Ala Tyr Glu Gin Val Lau Pha Val Thr Gly Val 

65 70 75 

CTC GTC CTC CTC CTC CCC ATT ACA TTC ATT ACT GCC TCT TAT GCC CTC 286 
Val Val Lau Lau Val Pro Ila Thr Pha Ila Thr Ala Sar Tyr Ala Lau 

80 8S 90 95 

ATC CTC GCT CCT GTG CTC CGA ATC CAC TCT GCG GAG GGG AGT CAC AAC 334 
Ila Lau Ala Ala Val Lau Arg Mat Hia Sar Ala Glu Gly Sar Cln Lya 

100 105 110 

GCC CTA CCC ACA TCC TCC TCT CAC CTC ACA CTC GTC AAT CTC TTC TAT 382 
Ala Lau Ala Thr Cya Sar Sar Hia Lau Thr Val Val Aan Lau Pha Tyr 

115 120 125 

GCG CCC CTT GTC TAC ACC TAC ATC TTA CCT GCT TCC TAT CAC TCA CCA 430 
Gly Pro Lau Val Tyr Thr Tyr Mat Lau Pro Ala Sar Tyr Hia Sar Pro 

130 135 140 
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OCC CAA GAC CAC ATA GTA TCC OTC TTT TAC ACC CTT CTC ACA CCC ATG 47B 
Gly Gin Asp Asp lis VaI S*r V*l Phs Tyr Thr Val L*u Thr Pro M«t 

145 ISO 155 

L*u 

160 

(2 J INFORMATION FOR 8CQ ZD NOt28l 

(1) SEQUXNCE CHARACTERISTICS: 

(A) LENGTH: 160 aaino acids 

(B) TYPft Amino 4cid 
(D) TOPOLOGY i line ax 

(ii) MOLECULE TYPIt protain 
(Jci) SEQUENCE DESCRIPTION: SEQ ID NO:28: 

XI* Cys Asn Pro Vmu Arg Tyr Pro Val L*u Mat S*r Gly Arg Val Cy* 

1 5 10 15 

L*u L*u M*t VaI VaI AIa S*r Trp L*u Gly Gly S*r L*u A*n AlA Sar 

20 25 30 

II* Gin Thr S*r Lau Thr L*u Gin Ph* Pro Tyr Cy* Gly S*r Arg Ly* 
35 40 45 

II* S«r His Ph* Ph* Cya Glu VaI Pro 5*r L*u L*u Xaa Xaa AIa Cy* 

SO 55 60 

Al* Asp Thr Glu Al* Tyr Glu Gin V*l L*u Ph* V*i Thr Gly V*l V*l 

65 70 75 80 

Val L*u Lau VaI Pro II* Thr Ph* XI* Thr Al* S«r Tyr AIa L*u II* 

85 90 95 

L*u AIa AIa VaI Lau Arg M*t Hi* S*r Al* Glu Gly S*r Gin Lys Al* 

100 105 110 

Lau AIa Thr Cy* S*r 5*r Hi* L*u Thr VaI VaI Asn Lau Ph* Tyr Gly 

115 120 125 

Pro Lau V*l Tyr Thr Tyr M*t L«u Pro Al* S*r Tyr Hi* S*r Pro Gly 

130 135 140 

Gin Asp Asp II* VaI 5*r VaI Ph* Tyr Thr val L*u Thr Pro M*t L*u 

145 150 155 160 

(2) INFORMATION FOR SIQ ID NO: 29: 

<i) SEQUENCE CHARACTKRISTICS: 

(A) LENGTH: 481 b*s* pairs 

(B) TYPE: nucl*ic acid 

(C) STRANDEDNESS: singla 

(D) TOPOLOGY: lin*ar 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: YES 
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<iv) ANTI-SENSE I NO 

(vi) RIGINAL SOURCIt 

(A) ORGANISM! rat olfactory apithaliua 

(B) STRAIN t Srpagua-Dawlay rat 

{ F) TISSUI TYPE i olfactory apithaliua 

(vli) IMMEDIATE SOURCE : 
(B) CLONIt J16 

(ix) FEATURE I 

(A) NAHK/KBYt CDS 
(8) LOCATION! 2..4B1 



(xl) SEQUENCE OISCRIPTIONt SSQ ID NOt29i 

C ATC TCT AGO CCT CTT CAC TAT CCT ACC CTC ATC ACC CAC ACA CTC 46 
I la Cya Arg Pro Lau Hia Tyr Pro Thr L«u Mat Thr Gin Thr Lau 

15 10 15 

TGT GCC AAG ATT GCC ACT GGT TGC TOO TTG GGA GGC TTG OCT GGG CCA 94 
Cya Ala Lya Ila Ala Thr Gly Cya Trp Lau Gly Gly Lau Ala Gly Pro 

20 25 30 

CTG CTA GAA ATT TCC TTG GTG TCT CGT CTC CTT TTT TGT GGC CCC AAT 142 
val val Glu Ila Sar Lau Val Sar Arg Lau L«u Pha Cya Gly Pro Aan 

35 40 45 

CAC ATT CAA CAC ATC TTT TGT GAT TTC CCA CCT CTG CTG AGC TTG OCT 190 
Hia Ila Gin Hia Ila Pha Cya Aap Pha Pro Pro Val Lau Sar Lau Ala 

50 55 60 

TGT ACT GAT ACA TCA GTG AAT CTC CTG CTA GAT TTT ATT ATA AAC CTC 238 
Cya Thr Aap Thr Sar Val Aan Val Lau Val Aap Pha Ila Ila Aan Lau 

65 70 75 

TGC AAG ATC CTG GCC ACC TTC CTG CTC ATC CTG AGC TCC TAC TTG CAG 286 
Cya Lya Ila Lau Ala Thr Pha Lau Lau Ila L«u Sar Sar Tyr Lau Gin 
80 85 90 95 

ATA ATC CCC ACA GTG CTC AAG ATT CCT TCA GCT GGA GGC AAG AAG AAA 334 

Ila Ila Arg Thr Val Lau Lya Ila Pro Sar Ala Ala Gly Lya Lya Lya 

100 105 110 

CCA TTC TOG ACT TGT GCC TCC CAT CTC ACT GTG GTT CTC ATC TTC TAT 382 

Ala Pha Sar Thr Cya Ala Sar Hia Lau Thr Val val Lau II* Pha Tyr 

115 120 125 

GGG AGC ATC CTT TTC ATC TAT GTG CGC CTG AAG AAG ACT TAC TCC CTT 430 
Gly Sar Ila Lau Pha Mat Tyr Val Arg Lau Lya Lya Thr Tyr Sar Lau 
130 135 140 

CAC TAC GAC AGA GCC TTG GGA GTA GTC TAC TCC CTC CTT ACC CCT TTC 478 

Aap Tyr Aap Arg Ala Lau Ala Val Val Tyr Sar Val Val Thr Pro Pha 

145 150 155 

481 



Lau 

160 



(2) INFORMATION FOR SEQ ID NO: 30: 
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(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH i 160 amino acida 

(B) TYPE: Amino acid 
(D) TOPOLOGY t linaar 

(ii) MOLECULE TYPE: protain 

(xi) SEQUENCE DESCRIPTION: SEQ ID NOi30: 

Ila Cy» Arg Pro Lau Hit Tyr Pro Thr Lau Mat Thr Gin Thr Lau Cya 

15 10 15 

Ala Lya Ila Ala Thr Giy Cya Trp Lau Gly Gly Lau Ala Gly Pro Val 

20 2S 30 

Val Glu I la Sar Lau Val Sar Arg Lau L#u Pha Cya Gly Pro Aan Hla 

35 40 45 

Ila Gin Hia 11a Pha Cya Aap Pha Pro Pro Val Lau Sar Lau Ala Cya 

50 55 60 

Thr Aap Thr Sar Val Aan Val Lau Val Aap Pha Ila Ila Aan Lau Cya 

65 70 75 80 

Lya Ila Lau Ala Thr Pha Lau Lau Ila Lau Sar Sar Tyr Lau Gin Ila 

85 90 95 

Ila Arg Thr Val Lau Lya Ila Pro Sar Ala Ala Gly Lya Lya Lya Ala 

100 105 110 

Pha Sar Thr Cya Ala Sar Hia Lau Thr Val val Lau Ila Ph« Tyr Gly 

115 120 125 

Sar Ila Lau Pha Mat Tyr Val Arg Lau Lya Lya Thr Tyr Sar Lau Aap 

130 135 140 

Tyr Aap Arg Ala Lau Ala Val Val Tyr S«r Val Val Thr Pro Pha Lau 

145 150 155 160 

(2) INFORMATION TOR SEQ ID NO: 31: 

(1) SEQUENCE CHARACTERISTICS i 

(A) LENGTH j 481 baea paira 

(B) TYPBt nuclaic acid 

(C) STRANDEDNESS : aingla 
<D) TOPOLOGY: llnaar 

(ii) MOLECULE TYPE: CDNA 

(iii) HYPOTHETICAL: YES 

<iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: rat olfactory apithalium 

(B) STRAIN: Srpagua-Dawlay rat 

(P) TISSUE TYPE* olfactory apithalium 

(vii) IMMEDIATE SOURCE: 
(B) CLONE: J17 
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{Lx) FEATURE i 

(A) NAME/KEY i CDS 
<B) LOCATION! 2. .481 



(xi) SEQUENCE DESCRIPTION: SEQ ID K0i31i 

A ATC TCC AAC CCA CTC CTT TAT TCC ACC AAA ATC TCC ACA CAA CTC 
He Cya Aan Pro Leu Leu Tyr Ser Thr Lye Met Ser Thr Gin Val 

15 10 15 



TTC 
Leu 

160 



(2) INFORMATION FOR SEQ ID NO: 32 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENCTH: 160 amino acids 
(8) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 32 



46 



TGT ATC CAG TTO GTT GCA GGA TCT TAT ATA GGC GOT TTT CTT AAT ACT 94 
Cye lie Gin Leu Val Ala Gly Ser Tyr He Cly Gly phe Leu Aan Thr 

20 25 30 

TGC CTC ATC ATG TTT TAC TTT TTC TCT TTT CTC TTC TGT GOG CCA AAT 142 
Cya Leu He Met Phe Tyr Phe Phe Ser Phe Leu Phe Cye Gly Pro Aan 

35 40 45 

ATA GTT GAT CAT TTT TTC TGT GAT TTT CCT CCT TTN NTG CAA CTT TCC 190 
He Val Aap Hia Phe Pha Cya Aap Phe Ala Pro Xaa Xaa Glu Leu Ser 

50 55 60 

TGC TCT GAT GTG ACT CTC TCT CTA GTT GTT ATG TCA TTT TCT OCT GGC 238 
Cya Ser Aap Val Ser Val Ser Val Val Val Met Ser Phe Ser Ale Gly 

65 70 75 

TCA GTT ACT ATG ATC ACA GTG TTT ATC ATA GCC ATC TCC TAT TCT TAC 286 
Ser Val Thr Met He Thr Val Phe He He Ala He Ser Tyr Ser Tyr 
80 85 90 95 

ATC CTC ATC ACC ATC CTG AAG ATG TCC TCA ACT GAG GGC CCT CAC AAG 334 
He Leu He Thr He Leu Lya Met Ser Ser Thr Glu Gly Krg Him Lya 

100 105 110 

CCT TTC TCC ACA TGT ACC TCC CAC CTC ACT GCA CTC ACT CTC TAC TAT 382 
Ala Phe Ser Thr Cye Thr Ser Hie Leu Thr Ala Val Thr Leu Tyr Tyr 

115 120 125 

GGC ACC ATT ACC TTC ATT TAT GTG ATG CCC AAG TCC ACA TAC TCT ACA 430 
Gly Thr He Thr Phe He Tyr Val Met Pro Lya Ser Thr Tyr Ser Thr 
130 135 140 

CAC CAG AAC AAG GTG CTC TCT GTG TTT TAC ATG CTG GTG ATC CCA ATC 478 
Aap Gin Aan Lya Val Val Ser Val Phe Tyr Met Val Val He Pro Met 

145 150 155 



481 
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ii* Cys Asn Pro Lau Lau Tyr Sar Thr Lys Mat Sar Thr Gin v*l Cys 

1 5 10 15 

I la Gin Lau Val Ala Gly Sar Tyr Ha Gly Gly Pha Lau Asn Thr Cys 

20 25 30 

Lsu I la Hat Pha Tyr Pha Pha Sar Pha Lau pha Cys Gly Pro Asn lis 

35 40 45 

Val Asp His Pha Pha Cys Asp Pha Al* Pro X*s Xaa Glu Lau Sar Cys 

50 55 60 

S«r Asp Val Sar Val Ssr Val Val Val Mat Sar Pha Sar Al* Gly Sar 

65 70 75 80 

V*l Thr M«t lis Thr Val Pha lis lis Ala lis Sar Tyr Sar Tyr Zla 

05 90 95 

Lau I la Thr Xla Lau Lys Mat Sar Sar Thr Glu Gly Arg His Lys Al* 

100 105 110 

Pha Sar Thr Cys Thr Sar His Lau Thr Ala Val Thr Lau Tyr Tyr Gly 

115 120 125 

Thr I la Thr Pha I la Tyr Val Mat Pro Lys Sar Thr Tyr Sar Thr Asp 

130 135 140 

Gin Asn Lys v*l Val Sar Val Pha Tyr Mat Val Val Xla Pro Mat Lau 

145 ISO 155 160 

(2) INFORMATION FOR SEQ ID NO: 33: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 479 baaa pairs 
(8) TYPE: nuclaic acid 

(C) STRANDED NESS t sing la 

(D) TOPOLOGY: linaar 

(ii) MOLECULE TYPE: CDNA 
(iii) HYPOTHETICAL: YES 
(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: rat olfactory apithaliun 

(B) STRAIN: Srpagua-Dawlay rat 

<F) TISSUE TYPE: olfactory apithaliun 

(vii) IMMEDIATE SOURCE: 
(B> CLONE: J19 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 2,. 479 

(Xi) SEQUENCE DE SCRIPT I N: SEQ ID NO: 33: 

T ATC TCC CAC CCT CTG AAC TAC ACA GTT ATC ATG AAT CAC TAT TTT 46 
I la Cys His Pro Lau Lys Tyr Thr Val lis Mat Aan Hia Tyr Pha 
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15 10 15 

TGT CTC ATC CTC CTC CTC TTC TCT CTC TTC CTT ACC ATT CCA CAT OCG 94 
Cys Val Met Leu Leu Leu Phe Ser Val Phe Val Ser He Ala His Ala 

20 25 30 

TTC TTC CAC ATT TTA ATC CTC TTG ATA CTC ACT TTC ACC ACA AAA ACT 142 
Leu Phe His XI* Leu Met Val Leu XI* Leu Thr Phe Ser Thr Lys Thr 

35 40 45 

GAA ATC CCT CAC TTT TTC TCT CAC CTC CCT CAT ATC ATC AAA CTT ACC 190 
Glu lis Pro His Ph* Ph« Cys Glu Leu Ala Bis Ila II* Lys Leu Thr 

SO 55 60 

TCT TCC CAT AAT TTT ATC AAC TAT CTC CTC ATA TAC ACA CAC TCT CTC 238 
Cys Ser Asp Asn Ph* XI* A*n Tyr L*u Leu XI* Tyr Thr Glu S*r Val 

65 70 75 

TTA TTT TTT GOT CTT CAT ATT CTA CCC ATC ATT TTC TCT TAT ATT TAC 286 
Leu Ph* Ph* Cly Val His II* Val Gly XI* XI* Leu Ser Tyr II* Tyr 
80 85 90 95 

ACT CTA TCC TCA CTT TTA ACA ATC TCA TTA TTC CCA GCA ATC TAT AAA 334 
Thr Val Ser Ser Val Leu Arg Met Ser Leu L*u Cly Gly Met Tyr Lys 

100 105 110 

CCC TTT TCA ACA TCT CCA TCT CAT TTC TCC CTT CTC TCT GTT TTA TGG 382 
Ala Ph* Ser Thr Cys Cly Ser His L*u Ser Val Val Ser Val Leu Trp 

115 120 125 

CAC ACC TTT TCC GCT ACA CAT AAC CTC TCC ACT TAC TCA CTC TCC AAC 430 
His Arg Ph* Trp Cly Thr His Lys Leu Ser Thr Tyr • Leu Ser Lys 

130 135 140 

GAA GAC TCT ACT CCC TTC ACT CAT CTA CAC TCT CCT TAC TCA CAT CCT C 479 
Clu Asp Cys Ser Gly Ph* Ser Asp Val His Cys Gly Tyr Ser Asp Ala 

145 150 155 



(2) INFORMATION FOR SEQ XD NO: 34: 

U) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1S9 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 34: 

lie Cys His Pro Leu Lys Tyr Thr Val lie Met Asn His Tyr Phe Cys 

15 10 15 

Val Met Leu Leu Leu Phe Ser Val Phe Val Ser He Ala His Ala Leu 

20 25 30 

Phe His He Leu Met Val Leu He Leu Thr Phe Ser Thr Lys Thr Glu 
35 40 45 

He Pro His Phe Phe Cys Clu Leu Ala His He He Lys Leu Thr Cys 

50 55 60 
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Ser Asp Asn Phe He Asn Tyr Leu Leu h # Tyr Thr lu 3« r v »l 

65 70 75 80 

Phe Phe Gly Val His He Val Cly u # n« ^ S er Tyr iu ^ thr 

85 90 95 

Val Ser Ser Val Leu Arg Met Ser Leu Leu OXy Gly *•* Tyr Lye Ala 

100 105 no 

Phe Ser Thr Cy* Oly Ser His Leu Ser Val Val *«r Val Leu Tro Hit 

115 120 125 

Arg Phe Trp Gly Thr His Lys Leu Ser Thr Tyr • Leu ser Lys clu 

130 135 140 

Asp Cys Ser Oly Phe Ser Asp Val His Cys Oly Tyr Ser Asp Aim 

145 150 155 

(2) INFORMATION FOR SEQ ID NO: 35: 

(i) SEQUENCE CHARACTERISTICS! 

(A) LENGTH i 481 bass pairs 

(B) TYPEs nucleic acid 

(C) STRANDEDNSSS I single 

(D) TOPOLOGY t UntiT 

(Li) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: YES 

(iv) ANTI-SENSE: HO 

(vi) ORIGINAL SOURCE 1 

(A) ORGANISM* rat olfactory epithelium 

(B> STRAIHi Srpegue-Davley rat 

(P) TISSUE TYPE i olfactory epithelium 

<vil) IMMEDIATE SOURCES 
(S) CLOWE I J20 

fix) FEATURE 1 

(A) HAMS/KEY t CDS 

(B) LOCATION 2..4S1 

(xi) SEQUENCE DESCRIPTIOMi SEQ ID NOi35i 

A ATC TOO TAC CCA CTO AGO TAC CTT CTC ATC ATG AGC TGG GTC GTG 46 
lis Cys Tyr Pro Leu Arg Tyr Leu Leu lis Met Ser Trp Vsl Val 

15 10 15 

TCC ACA CCA CTG TCC GTG CCA ATC TOO CTC ATA GGC TTT TGT GCC TCC 94 
Cys Thr Ala Leu Ser Val Ala lie Trp Vsl He Cly Phe Cys Ala Ser 

20 25 30 

GTT ATA CCT CTC TGC TTC AOG ATC CTC CCA CTC TGT GOT CCT TAC GTC 142 
val He Pro Leu Cys Phe Thr He Leu Pro Leu Cys Gly Pro Tyr Val 

35 40 45 

CTT GAT TAT CTT TTC TGC GAG CTG CCC ATC CTT CTO CAC CTG TTC TGC 190 
Val Asp Tyr Leu Phe Cys Clu Leu Pro He Leu Leu His Leu Phe Cys 

SO 55 60 

ACA GAT ACA TCT CTC CTC CAC NNN NNN NNN NKN NUN NUN NNN NNN NNN 238 
Thr Asp Thr Ser Leu Leu Clu Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 

65 70 75 
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NNJJ NKN NNN NNN NNN CCC .TTC CTC CTC ATT CTT CTC TCC TAG CTT CGC 2B6 

Xaa Xaa Xaa Xaa Xaa Pro Pha Lau Lau ll« val Lau Sar Tyr Lau Arg 

80 85 90 95 

ATC CTC CTG OCT CTC ATA AGA ATA CAC TCA CCT CAC CCC ACA AAA AAC 334 
Ila L«u Val Ala Val Il» Arg Ila Alp Sar Ala Glu Cly Arg Ly» Lya 

100 105 HO 

CCC TTT TCA ACT TCT CCT TCA CAC TTC CCT CTC CTC ACC ATC TAC TAT 382 
Ala Pha Sar Thr Cya Ala Sar Hia Lau Ala Val Val Thr I la Tyr Tyr 

115 120 125 

CCA ACA GCC CTC ATC AGG TAC TTC AGG CCC AAC TCC CTT TAT TCC CCT 430 
Cly Thr Cly Lau Ila Arg Tyr Lau Arg Pro Lya Sar Lau Tyr Sar Ala 
130 135 140 

CAC CCA CAC AGA CTC ATC TCT CTC TTC TAT CCA CTC ATT CCC CCT CCA 478 
Glu Gly Asp Arg Lau Ila Sar Val Pha Tyr Ala Val Ila Cly Pro Ala 

145 ISO 155 

481 



Lau 

160 



(2) INFORMATION FOR SfQ ID NOi36: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 160 amino acid* 

(B) TYPE i amino acid 
<D) TOPOLOGY t linaar 

<ii) MOLECULE TYPE: protain 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 36: 

Ila Cya Tyr Pro Lau Arg Tyr Lau Lau Ila Mat Sar Trp Val Val Cya 

15 10 15 

Thr Ala Lau Sar Val Ala Ila Trp Val Ila Cly Pha Cya Ala Sar Val 

20 25 30 

Ila Pro Lau Cya Pha Thr Ila Lau Pro Lau Cya Gly Pro Tyr Val Val 
35 40 45 

Asp Tyr Lau Pha Cya Glu Lau Pro Ila Lau Lau Hia Lau Pha Cya Thr 

50 55 60 

Aap Thr Sar Lau Lau Glu Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 

65 70 75 80 

Xaa Xaa Xaa Xaa Pro Pha Lau Lau Ila Val Lau Sar Tyr Lau Arg Ila 

85 90 95 

Lau Val Ala Val Ila Arg Ila Aap Sar Ala Glu Gly Arg Lya Lya Ala 

100 105 110 

Pha Sar Thr Cy» Ala Sar Hi» Lau Ala v*l Val Thr Ila Tyr Tyr Cly 

115 120 125 

Thr Cly Lau Ila Arg Tyr Lau Arg Pro Lya Sar Lau Tyr Sar Ala Glu 

130 135 140 

Cly Aap Arg Lau Ila Sar Val Pha Tyr Ala Val Ila Gly Pro Ala Lau 

145 150 155 160 
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irhat is claimed is: 

1. An isolated nucleic acid, molecule encoding an odorant 
receptor. 

5 

2. An isolated DNA of claim 1, 

3. An isolated cDNA of claim 2. 

10 4. An isolated cDNA of claim 3 wherein the sequence of the 

cDNA has the nucleotide sequence shown in Figure 9. 

5. An isolated cDNA of claim 3 wherein the sequence of the 
cDNA has the nucleotide sequence shown in Figure 10. 

15 

6* An isolated cDNA of claim 3 wherein the sequence of the 
cDNA has the nucleotide sequence shown in Figure 11. 

7. An isolated cDNA of claim 3 wherein the sequence of the 
20 cDNA has the nucleotide sequence shown in Figure 12. 

8. An isolated cDNA of claim 3 wherein the sequence of the 
cDNA has the nucleotide sequence shown in Figure 13. 

25 9. An isolated cDNA of claim 3 wherein the sequence of the 

cDKA has the nucleotide sequence shown in Figure 14. 

10. An isolated cDNA of claim 3 wherein the sequence of the 
cDNA has the nucleotide sequence shown in Figure 15. 

30 

11. An isolated cDNA of claim 3 wherein the sequence of the 
cDNA has the nucleotide sequence shown in Figure 16. 

12. An isolated cDNA of claim 3 wherein the sequence of the 
35 cDNA has the nucleotide sequenc shown in Figure 17. 
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13. An isolated cDNA of claim 3 wherein the sequence of the 
cONA has the nucleotide sequence shown in Figure 18. 

14. An isolated cDNA of claim 3 wherein the sequence of the 
5 cDNA has the nucleotide sequence shown in Figure 19. 

15. An isolated cDNA of claim 3 wherein the sequence of the 
cONA has the nucleotide sequence shown in Figure 20. 

10 16. An isolated cDNA of claim 3 wherein the sequence of the 

cDNA has the nucleotide sequence shown in Figure 21. 

* 

17. An isolated cDNA of claim 3 wherein the sequence of the 
cDNA has the nucleotide sequence shown in Figure 22. 

15 

18. An isolated cDNA of claim 3 wherein the sequence of the 
cDNA has the nucleotide sequence shown in Figure 23* 

19. An isolated cDNA of claim 3 wherein the sequence of the 
20 cDMA has the nucleotide sequence shown in Figure 24. 

20. An isolated cONA of claim 3 wherein the sequence of the 
cDNA has the nucleotide sequence shown in Figure 25. 

25 21. An isolated cDNA of claim 3 wherein the sequence of the 

cDNA has the nucleotide sequence shown in Figure 26. 

22. An isolated cDNA of claim 3 wherein the sequence of the 
cONA has the nucleotide sequence shown in Figure 27. 

30 

23. An isolated cDNA of claim 3 wherein the sequence of the 
cDMA has the nucleotide sequence shown in Figure 28. 

24. An isolated cDNA of claim 3 wherein the sequence of the 
35 cDNA has the nucleotide sequence shown in Figure 29. 
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An isolated cDNA of claim 3 wherein the sequence of the 
cDNA has the nucleotide sequence shown in Figure 30. 

An isolated cDNA of claim 3 wherein the sequence of the 
cDNA has the nucleotide sequence shown in Figure 31. 

An isolated cDNA of claim 3 encoding an insect odorant 
receptor. 

An isolated cDNA of claim 3 encoding a vertebrate 
odorant receptor. 

An isolated cDNA of claim 3 encoding a fish odorant 
receptor • 

An isolated cDNA of claim 3 encoding a mammalian 
odorant receptor. 

An isolated cDNA of claim 30 wherein the mammalian 
odorant receptor is a human odorant receptor. 

An isolated cDNA of claim 30 wherein the mammalian 
odorant receptor is a rat, dog or mouse odorant 
receptor. 

An expression vector comprising the cDNA of claim 3 and 
the sequence elements necessary for replication and 
expression in a suitable host. 

An expression vector comprising the cDNA of any of 
claims 4-19 and the sequence elements necessary for 
replication and expression in a suitable host. 
A purified protein encoding an odorant receptor. 
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36. A purified protein of claim 35 wherein the amino acid 
sequence is the sequence in Figure 9. 

37. A purified protein of claim 35 wherein the amino acid 
5 sequence is the sequence in Figure 10. 

38. A purified protein of claim 35 wherein the amino acid 
sequence is the sequence in Figure 11. 

10 39. A purified protein of claim 35 wherein the amino acid 

sequence is the sequence in Figure 12. 

40. A purified protein of claim 35 wherein the amino acid 
sequence is the sequence in Figure 13. 

15 

41. A purified protein of claim 35 wherein the amino acid 
sequence is the sequence in Figure 14 . 

42. A purified protein of claim 35 wherein the amino acid 
20 sequence is the sequence in Figure 15, 

43. A purified protein of claim 35 wherein the amino acid 
sequence is the sequence in Figure 16. 

25 44 • A purified protein of claim 35 wherein the amino acid 

sequence is the sequence in Figure 17. 

45. A purified protein of claim 35 wherein the amino acid 
sequence is the sequence in Figure 18. 

30 

46. A purified protein of claim 35 wherein the amino acid 
sequence is the sequence in Figure 19. 

47. A purified protein of claim 35 wherein the amino acid 
35 sequence is the sequence in Figure 20. 

SUBSTITUTE SHEET 



1 *■ A I A 



WO 92/17585 PC17US92/02741 



-89- 



48. A purified protein of claim 35 wherein the amino acid 
sequence is the sequence in Figure 21. 

49* A purified protein of claim 35 wherein the amino acid 
5 sequence is the sequence in Figure 22. 

50. A purified protein of claim 35 wherein the amino acid 
sequence is the sequence in Figure 23. 

10 51. A purified protein of claim 35 wherein the amino acid 

sequence is the sequence in Figure 24. 

52. A purified protein of claim 35 wherein the amino acid 
sequence is the sequence in Figure 25. 

15 

53. A purified protein of claim 35 wherein the amino acid 
sequence is the sequence in Figure 26. 

54. A purified protein of claim 35 wherein the amino acid 
20 sequence is the sequence in Figure 27. 

55. A purified protein of claim 35 wherein the amino acid 
sequence is the sequence in Figure 28. 

25 56. A purified protein of claim 35 wherein the amino acid 

sequence is the sequence in Figure 29. 

57. A purified protein of claim 35 wherein the amino acid 
sequence is the sequence in Figure 31. 

30 

58. A purified protein of claim 3 5 encoding an insect 
odorant receptor. 

59. A purified protein of claim 35 encoding a vertebrate 
35 odorant receptor. 
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60. A purified protein of claim 35 encoding a fish odorant 
receptor. 

61. A purified protein of claim 35 encoding a mammalian 
5 odorant receptor. 

62. A purified protein of claim 61 wherein the mammalian 
odorant receptor is a human odorant receptor. 

10 63. A purified protein of claim 61 wherein the mammalian 

odorant receptor is a rat, dog or mouse odorant 
receptor, 

64. A purified protein of claim 35 which has 7 
15 transmembrane regions and whose third cytoplasmic loop 

from the N-terminus is approximately 17 amino acid 
long. 

65. A method of transforming cells which comprises 
20 transfecting a host cell with a suitable expression 

vector of claim 33* 

66. A method of transforming cells which comprises 
transfecting a host cell with a suitable expression 

25 vector of claim 34. 

67. Cells transformed by the method of claim 65. 

68. Transformed cells of claim 67 wherein the cells are 
30 olfactory cells. 

69. Transformed cells of claim 67 wherein the cells are 
non-olfactory cells. 



35 
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A method of identifying a desired odorant ligand 
comprising contacting transformed non-olfactory cells 
of claim 69, expressing a known odorant receptor with 
a series of odorant ligands and determining which 
ligands bind to the receptors present on the non* 
olfactory cells. 

A method of identifying a desired odorant receptor 
comprising contacting a series of transformed non- 
olfactory cells of claim 69 with a known odorant ligand 
and determining which odorant receptor binds with the 
odorant ligand. 

A method of detecting an odor which comprises: 

a) identifying a odorant receptor which binds the 
desired odorant ligand by the method of claim 71 
and ; 

b) imbedding the receptor in a membrane such that 
when the odorant ligand binds with the receptor 
identified in a) above, a detectable signal is 
produced. 

A method of claim 72 wherein the desired odorant is a 
pheromone . 

A method of claim 72 wherein the desired odorant ligand 
is the vapors emanating from cocaine, marijuana, 
heroin, hashish, or angel dust. 

A method of claim 72 wherein the desired odorant ligand 
is the vapors emanating from gasoline, natural gas or 
alcohol. 
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76. A method of claim 72 wherein the desired odorant ligand 
is the vapors emanating from decayed human flesh* 

77. A method of claim 72 wherein the desired odorant ligand 
is the vapors emanating from explosives, plastic 
explosives, firearms, or gun powder. 

78. A method of claim 72 wherein the desired odorant ligand 
is toxic fumes, noxious fumes or dangerous fumes. 

79. A method of claim 72 wherein the membrane is a cell 
membrane . 



80. A method of claim 72 wherein the membrane is an 
15 olfactory cell membrane. 

81. A method of claim 72 wherein the membrane is a 
synthetic membrane. 

20 82. A method of claim 72 wherein the detectable signal is 

a color change, phospor escence , or radioactivity. 

83. A method of quantifying the amount of an odorant ligand 
present in a sample which comprises the method of claim 

25 72 wherein the detectable signal is quantified. 

84. A method of developing fragrances which comprises 
identifying a desired odorant receptor by the method of 
claim 71 then contacting non-olfactory cells, which 

30 have been transfected with an expression vector 

containing the cDNA of the desired odorant receptor 
such that the receptor is expressed upon the surface of 
the non-olfactory cell, with a series of compounds to 
determine which ones bind with the receptor. 

35 
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85. A method of identifying an odorant fingerprint which 
comprises contacting a series of cells, which have been 
transformed such that each express a known odorant 
receptor, with a desired sample and determining the 

5 type and quantity of the odorant ligands present in the 

sample. 

86. A method of identifying odorant ligands which inhibit 
the activity of a desired odorant receptor which 

10 comprises contacting the desired odorant receptor with 

a series of compounds and determining which compounds 
inhibit the odorant ligand - odorant receptor 
interaction. 

15 87. A method of identifying appetite suppressant compounds 

which comprises identifying odorant ligands by the 
method of claim 86 wherein the desired odorant receptor 
is that which is associated with the perception of 
food. 

20 

88. A method of controlling appetite in a subject which 
comprises contacting the olfactory epithelium of the 
subject with the odorant ligands identified by the 
method of claim 87. 

25 

89. A nasal spray, to control appetite comprising the 
compounds identified by the method of claim 87 in a 
suitable carrier. 

30 90. A method of trapping odors which comprises contacting 

a membrane which contains multiples of the desired 
odorant receptor, with a sample such that the desired 
odorant ligand is absorbed by the binding of the 
odorant ligand to the odorant receptor. 



35 
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91. An odor trap employing the method of claim 90. 

92. A method of controlling pest populations which 
comprises identifying odorant ligands by the method of 

5 claim 70 which are alarm odorant ligands and spraying 

the desired area with the identified odorant ligands. 

93. A method of controlling a pest population which 
comprises identifying odorant ligands by the method of 

10 claim 70 which interfere with the interaction between 

the odorant ligands and the odorant receptors which are 
associated with fertility. 

94. A method of claim 92 or 93 wherein the pest population 
15 is a population of insects. 

95. A method of claim 92 or 93 wherein the pest population 
is a population of rodents. 

20 96. A method of claim 95 wherein the population of rodents 

is a population of mice or rats. 

97. A method of promoting fertility which comprises 
employing the method of claim 70 to identify odorant 

25 ligands which interact with the odorant receptors 

associated with fertility and administering the 
identified odorant ligands to a subject. 

98. A method of inhibiting fertility which comprises 
30 employing the method of claim 70 to identifying odorant 

ligands which inhibit the interaction between the 
odorant ligands and the odorant receptors associated 
with fertility and administering the identified odorant 
ligands to a subject. 
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Pigur 4A 
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expression vectors containing such nucleic acid. Also provided is a purified protein encoding an odorant receptor, with the afore- 
mentioned expression vectors and the resulting transformed cell. The invention also provides methods of identifying odorant li- 
gands and of identifying odorant receptors. The invention further provides methods of developing fragrances, of identifying 
appetite suppressant compounds, of controlling appetite, of controlling pest populations, of promoting and inhibiting fertility, 
and of detecting odors. 
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ODORAHT RECEP TORS AMD DflBB THEREOF 

Background of the Invention 

5 This application is a continuation-in-part of U.S. Serial 
No. 68 1,880, filed April 5, 1991, the contents of which are 
hereby incorporated by reference. 

Throughout this application, various publications are 
10 referenced by Arabic numerals within parentheses. Full 
citations for these publications may be found at the end of 
the specification immediately preceding the claims. The 
disclosures of these publications in their entireties are 
hereby incorporated by reference into this application in 
15 order to more fully describe the state of the art as known 
to those skilled therein as of the date of the invention 
described and claimed herein. 

In vertebrate sensory systems, peripheral neurons respond to 

20 environmental stimuli and transmit these signals to higher 
sensory centers in the brain where they are processed to 
allow the discrimination of complex sensory information. 
The delineation of the peripheral mechanisms by which 
environmental stimuli are transduced into neural information 

25 can provide insight into the logic underlying sensory 
processing. Our understanding of color vision, for example, 
emerged only after the observation that the discrimination 
of hue results from the blending of information from only 
three classes of photoreceptors (1, 2, 3, 4). The basic 

30 logic underlying olfactory sensory perception, however, has 
remained elusive. Mammals possess an olfactory system of 
enormous discriminatory power (5, 6} . Humans, for example, 
are thought to be capable of distinguishing among thousands 
of distinct odors. The specificity of odor recognition is 

35 emphasized by the observation that subtle alterations in the 
molecular structure of an odorant can lead to profound 
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changes in perceived odor. 

The detection of chemically distinct odorant presumably 
results from the association of odorous ligands with 
5 specific receptors on olfactory neurons which reside in a 
specialized epithelium in the nose. Since these receptors 
have not been identified, it has been difficult to determine 
how odor discrimination might be achieved. It is possible 
that olfaction, by analogy with color vision, involves only 

10 a few odor receptors, each capable of interaction with 
multiple odorant molecules. Alternatively, the sense of 
smell may involve a large number of distinct receptors each 
capable of associating with one or a small number of 
odorant. In either case, the brain must distinguish which 

15 receptors or which neurons have been activated to allow the 
discrimination between different odorant stimuli. Insight 
into the mechanisms underlying olfactory perception is 
likeLY to depend upon the isolation of the odorant 
receptors, and the characterization of their diversity, 

20 specificity, and patterns of expression. 

The primary events in odor detection occur in a specialized 
olfactory neuroepithelium located in the posterior recesses 
of the nasal cavity. Three cell types dominate this 

2 5 epithelium (Figure 1A) : the olfactory sensory neuron, the 
sustentacular or supporting cell, and the basal cell which 
is a stem cell that generates olfactory neurons throughout 
life (7, 8). The olfactory sensory neuron is bipolar: a 
dendritic process extends to the mucosal surface where it 

30 gives rise to a number of specialized cilia which provide an 
extensive, receptive surface for the interaction of odors 
with olfactory sensory neurons. The olfactory neuron also 
gives rise to an axon which projects to the olfactory bulb 
of the brain, the first relay in the olfactory system. The 

35 axons of the olfactory bulb neurons, in turn, project to 
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subcortical and cortical regions where higher level 
processing of olfactory information allows the 
discrimination of odors by the brain. 

5 The initial events in odor discrimination are thought to 
involve the association of odors with specific receptors on 
the cilia of olfactory neurons. Selective removal of the 
cilia results in the loss of olfactory response (9) . 
Moreover, in fish, whose olfactory system senses amino acids 

10 as odors, the specific binding of amino acids to isolated 
cilia has been demonstrated (10, 11) . The cilia are also 
the site of olfactory signal transduction. Exposure of 
isolated cilia from rat olfactory epithelium to numerous 
odorant leads to the rapid stimulation of adenylyl cyclase 

15 and elevations in cyclic AMP (an elevation in IP3 in 
response to one odorant has also been observed) {12, 13, 14, 
15) . The activation of adenylyl cyclase is dependent on the 
presence of GTP and is therefore likely to be mediated by 
receptor -coupled GTP binding proteins (G-proteins) (16) . 

20 Elevations in cyclic AMP, in turn, are thought to elicit 
depolarization of olfactory neurons by direct activation of 
a cyclic nucleotide-gated, cation permeable channel (17, 
18), This channel is opened upon binding of cyclic 
nucleotides to its cytoplasmic domain, and can therefore 

25 transduce changes in intracellular levels of cyclic AMP into 
alterations in the membrane potential. 

These observations suggest a pathway for olfactory signal 
transduction (Figure IB) in which the binding of odors to 

30 specific surface receptors activates specific G-proteins. 
The G-proteins then initiate a cascade of intracellular 
signalling events leading to the generation of an action 
potential which is propagated along the olfactory sensory 
axon to the brain. A number of neurotransmitter and hormone 

35 receptors which transduce intracellular signals by 
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activation of specific G-proteins have been identified. 
Gene cloning has demonstrated that each of these receptors 
is a member of a large super family of surface receptors 
which traverse the membrane seven times (19 ( 20). The 
5 pathway of olfactory signal transduction (Figure IB) 
predicts that the odor ant receptors might also be members of 
this superfamily of receptor proteins. The detection of 
odors in the periphery is therefore likely to involve 
signalling mechanisms shared by other hormone or 

10 neurotransmitter systems, but the vast discriminatory power 
of the olfactory system will require higher order neural 
processing to permit the perception of individual odors. 
This invention address the problem of olfactory perception 
at a molecular level. Eighteen different members of an 

15 extremely large multigene family have been cloned and 
characterized which encodes seven transmembrane domain 
proteins whose expression is restricted to the olfactory 
epithelium. The members of this novel gene family encode 
the individual odorant receptors. 

20 
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flUMMARY OP THE IMVKWTION 

The invention provides an isolated nucleic acid, e.g. a DNA 
and cDNA molecule, encoding an odorant receptor. The 
5 invention further provides expression vectors containing 
such nucleic acid. Also provided by the invention is a 
purified protein encoding an odorant receptor. The 
invention further provides a method of transforming cells 
which comprises transfecting a suitable host cell with a 
10 suitable expression vector containing the nucleic acid 
encoding the odorant receptor. 

The invention also provides methods of identifying odorant 
ligands and of identifying odorant receptors. The invention 

15 further provides methods of developing fragrances, of 
identifying appetite suppressant compounds, of controlling 
appetite. The invention also provides methods of 
controlling insect and other animal populations. The 
invention additionally provides a method of detecting odors 

2 0 such as the vapors emanating from Cocaine, Marijuana, 
Heroin, Hashish, Angel Dust, gasoline, decayed human flesh, 
alcohol, gun powder explosives, plastic explosives, 
firearms, poisonous or harmful smoke, or natural gas. 
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Description of the riaurmm 

Figure 1. The Olfactory Neuroepithelium and a Pathway for 
Olfactory Signal Transduction. A. The Olfactory 
5 Neuroepithelium. The initial event in odor perception 
occurs in the nasal cavity in a specialized neuroepithelium 
which is diagrammed here. Odors are believed to interact 
with specific receptors on the cilia of olfactory sensory 
neurons. The signal generated by these initial binding 

10 events are propagated by olfactory neuron axons to the 
olfactory bulb. B . A Pathway of Olfactory Signal 
Transduction. In this scheme, the binding of an odorant 
molecule to an odor-specific transmembrane receptor leads to 
the interaction of the receptor with a GTP-binding protein 

15 ( G s(oif))- This interaction, in turn, leads to the release 
of the GTP-coupled a-subunit of the G-protein, which then 
stimulates adenylyl cyclase to produce elevated levels of 
cAMP. The increase in cAMP opens nucleotide-gated cation 
channels, thus causing an alteration in membrane potential. 

20 

Figure 2. A PCR Amplification Product Containing Multiple 
Species of DNA . cDNA prepared from olfactory epithelium RNA 
was subjected to PCR amplification with a series of 
different primer oligonucleotides and the DNA products of 
25 appropriate size were isolated, further amplified by PCR, 
and size fractionated on agarose gels (A) (For details, see 
text) . Each of these semipurif ied PCR products was digested 
with the restriction enzyme, Hinf I, and analyzed by agarose 
gel electrophoresis. Lanes marked "M" contain size markers 

30 of 23.1, 9.4, 5.6, 4.4, 2.3, 2.0, 1.35, 1.08, 0.87, 0.60, 
0.31, 0.28, 0.23, 0.19, 0.12 and 0.07kb. (B) . Twenty- 
two of the 64 PCR products that were isolated and digested 
with Hinf I are shown here. Digestion of one of these, PCR 
13, yielded a large number of fragments whose sizes summed 

35 to a value much greater than that of the undigested PCR 13 
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DNA, indicating that PGR 13 might contain multiple species 
of DNA which are representatives of a multigene family. 

Figure 3. Northern Blot Analysis with a Mixture of Twenty 
5 Probes . One ng of polyA+ RNA isolated from rat olfactory 
epithelium, brain, or spleen was si2e-f ractionated in 
formaldehyde agarose! blotted onto a nylon membrane, and 
hybridized with a 32 P- labeled mixture of segments of 20 cDNA 
clones. The DNA segments were obtained by PCR using primers 
10 homologous to transmembrane domains 2 and 7* 

Figure 4. The Protein Seque nces Encoded bv Ten Divergent 
cDNA Clones . Ten divergent cDNA clones were subjected to 
DNA sequence analyses and the protein sequence encoded by 

15 each was determined* Amino acid residues which are 
conserved in 60% or more of the proteins are shaded. The 
presence of seven hydrophobic domains (I-VII) , as well as 
short conserved motifs shared with other members of the 
super family, demonstrate that these proteins belong to the 

20 seven transmembrane domain protein superfamily. Motifs 
conserved among members of the superfamily and the family of 
olfactory proteins include the GN in TM1 (transmembrane 
domain 1), the central W of TM4 , the Y near the C-terminal 
end of TM5, and the NP in TM7. In addition, the DRY motif 

25 C-terminal to TM3 is common to many members of the G- 
protein-coupled superfamily. However, all of the proteins 
shown here share sequence motifs not found in other members 
of this superfamily and are clearly members of a novel 
family of proteins. 

30 

Figure 5. Positions of Greatest Variability in the 
Ollactory Protein Family . In this diagram, the protein 
encoded by cDNA clone 115 is shown traversing the plasma 
membrane seven times with its N-terminus located 
35 extracellularly , and its C-terminus intracellularly. The 
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vertical cylinders delineate the seven putative a-helices 
spanning the membrane. Positions at which 60% or more of 
the 10 clones shown in Figure 4 share the same residue as 
115 are shown as white balls. More variable residues are 
5 shown as black balls. The high degree of variability 
encountered in transmembrane domains III, IV # and V is 
evident in this schematic. 

Figure 6. The Presence of Subfamilies in a Divergent 
10 Multiaene Family . Partial nucleotide sequences and deduced 
protein sequences were obtained for 18 different cDNA 
clones. Transmembrane domain V along with the flanking loop 
sequences, including the entire cytoplasmic loop between 
transmembrane domains V and VI, are shown here for each 
15 protein. Amino acid residues found in 60% or more of the 
clones in a given position are shaded (A) . This region of 
the olfactory proteins (particularly transmembrane domain V) 
appears to be highly variable (see Figure 4) . These 
proteins, however, can be grouped into subfamilies (B,C,D) 
20 in which the individual subfamily members share considerable 
homology in this divergent region of the protein. 

Figure 7. Southern Blot Analyses with Non-crosshybridizina 
Fragments of Divergent cDNAs . Five ^g of rat liver DNA was 

25 digested with Eco RI (A) or Hind III (B) , electrophoresed in 
0.75% agarose, blotted onto a nylon membrane, and hybridized 
to the 32 P- labeled probes indicated. The probes used were 
PCR-generated fragments of: 1, clone F9 (identical to F12 
in Figure 4); 2, F5; 3, F6; 4, 13; 5, 17; 6, 114; or 7, 115. 

30 The lane labeled "1-7" was hybridized to a mixture of the 
seven probes. The probes used showed either no 
crosshybridization or only trace crosshybridization with one 
another. The size markers on the left correspond to the 
four blots on the left (1-4) whereas the marker positions 

3 5 noted on the right correspond to the four blots on the right 
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(5-7, "1-7"). 



Figure 8, Northern Blot Analysis with a Mix of Seven 
Divergent Clones . One nq of polyA+ RNA from each of the 
tissues shown was size-fractionated, blotted onto a nylon 
membrane, and hybridized with a 32 P-labeled mixture of 
segments of seven divergent cDNA clones (see Legend to 
Figure 7 ) . 



10 



Figure 9. The amino acid and nucleic acid sequence of clone 
F3. 



15 



Figure 10. The amino acid and nucleic acid sequence of 
clone F5. 

Figure 11. The amino acid and nucleic acid sequence of 
clone F6. 



20 



Figure 12. The amino acid and nucleic acid sequence of 
clone F12. 



Figure 13. The amino acid and nucleic acid sequence of 
clone 13. 

25 Figure 14. The amino acid and nucleic acid sequence of 
clone 17. 

Figure 15. The amino acid and nucleic acid sequence of 
clone 18. 



30 



Figure 16. The amino acid and nucleic acid sequence of 
clone 19. 



Figure 17. The amino acid and nucleic acid sequence of 
35 clone 114. 
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Figure 18. The amino acid and nucleic acid sequence of 
clone 115. 

Figure 19. The amino acid and nucleic acid sequence of 
5 human clone H5. 

Figure 20. The amino acid and nucleic acid sequence of 
clone Jl, where the reading frame starts at nucleotide 
position 2. 

10 

Figure 21. The amino acid and nucleic acid sequence of 
clone J2. 

Figure 22. The amino acid and nucleic acid sequence of 
15 clone J4, where the reading frame starts at nucleotide 
position 2. 

Figure 23. The amino acid and nucleic acid sequence of 
clone J7 f where the reading frame starts at nucleotide 
20 position 2. 

Figure 24. The amino acid and nucleic acid sequence of 
clone J8 1 where the reading frame starts at nucleotide 
positon 2. 

25 

Figure 25. The amino acid and nucleic acid sequence of 
clone Jll. 

Figure 26. The amino acid and nucleic acid sequence of 
30 clone J14, where the reading frame starts at nucleotide 
position 2. 

Figure 27. The amino acid and nucleic acid sequence of 
clone J15, where the reading frame starts at nucleotide 
35 psition 2. 
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Figure 28. The amino acid and nucleic acid sequence of 
clone J16, where the reading frame starts at nucleotide 
position 2. 

5 Figure 29. The amino acid and nucleic acid sequence of 
clone J17, where the reading frame starts at nucleotide 
position 2. 

Figure 30. The amino acid and nucleic acid sequence of 
10 clone J19, where the reading frame starts at nucleotide 
position 2. 

Figure 31. The amino acid and nucleic acid sequence of 
clone J20, where the reading frame starts at nucleotide 
15 position 2, 

Figure 32. SOUTHERN BLOT: Five micrograms of DNA isolated 
from 1. Human placenta, 2* NCI-H-1011 neuroblastoma cells, 
or 3. CHP 134 neuroblastoma cells were treated with the 

20 restriction enzyme A. Eco RI, B. Hind III, C. Bam HI, or D. 
Pst I f and then electrophoresed on an agarose gel and 
blotted onto a nylon membrane. The blotted DNA was 
hybridized to the 32P-labeled H3/H5 sequence. An 
autoradiograph of the hybridized blot is shown with the 

25 sizes of co-elect rophoresed size markers noted in kilobases. 
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D*tail«d Description of the Invtnt^on 

The invention provides an isolated nucleic acid, e.g. a DNA 
or cDNA molecule, encoding an odorant receptor. Such a 
5 receptor is a receptor which binds an odorant ligand and 
include but not limited to pheromone receptors. An odorant 
ligand may include, but is not limited to, molecules which 
interact with the olfactory sensory neuron, molecules which 
interact with the olfactory cilia, pheromones, and molecules 
10 which interact with structures within the vomeronasal organ. 

The invention specifically provides the isolated cDNAs 
encoding odorant receptors the sequences of which are shown 
in Figures 9-31. The nucleic acid is most typically a cDNA 
15 and encodes an insect, a vertebrate, a fish or a mammalian 
odorant receptor. The mammalian odorant receptor is 
preferably a human, rat, mouse or dog receptor. In an 
embodiment, human odorant receptor cDNA sequence and the 
correspondent protein is isolated (Figure 19) . 

20 

In another embodiment, phermone receptors are isolated and 
shown as clones Jl, J2, J4, J7, J8, Jll, J14, J15, J16, J17, 
J19 and J20 (Figures 20-31) . 

25 The invention further provides expression vectors containing 
cDNA which encodes odorant receptors. Such expression 
vectors are well known in the art and include in addition to 
the nucleic acid the elements necessary for replication and 
expression in a suitable hosts. Suitable hosts are well 

30 known in the art and include without limitation bacterial 
hosts such as £. coli, animal hosts such as CHO cells, 
insect cells, yeast cells and like. 

The invention also provides purified proteins encoding 
3 5 odorant receptors. Such proteins may be prepared by 
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expression of the f orementioned expression vectors in 
suitable host cells and recovery and purification of the 
receptors using methods well known in the art. Examples of 
such proteins include those having the amino acid sequences 
5 shown in figures 9-31. 

The purified protein typically encodes an insect, 
vertebrate, fish or mammalian odorant receptor. The 
mammalian odorant receptor may be a human, rat, mouse or 
10 dog. 

In one embodiment the invention provides a novel purified 
protein which belong to a class of proteins which have 7 
transmembrane regions and a third cytoplasmic loop from the 
15 N-terminus which is approximately 17 amino acid long and to 
nucleic acid molecules encoding such proteins. 

The invention provides methods of transforming cells which 
comprises transfecting a suitable host cell with a suitable 

20 expression vector containing nucleic acid encoding of the 
odorant receptor. Techniques for carrying out such 
transformations on cells are well Known to those skilled in 
the art. (41,42) Additionally, the resulting transformed 
cells are also provided by the invention. These transformed 

25 cells may be either olfactory cells or non-olfactory cells. 
One advantage of using transformed non-olfactory cells is 
that the desired odorant receptor will be the only odorant 
receptor expressed on the cell's surface. 

3 0 In order to obtain cell lines that express a single receptor 
type, standard procedures may be used to clone individual 
cDNAs or genes into expression vectors and then transfect 
the cloned sequences into mammalian cell lines. This 
approach has been used with sequences encoding some other 

35 members of the seven transmembrane domain superfamily 
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including the SHTlc serotonin receptor, (43) The cited work 
illustrates how members of this superfamily transferred into 
cell lines may generate immortal cell lines that express 
high levels of the transfected receptor on the cell surface 
5 where it will bind ligand and that such abnormally expressed 
receptor molecules can transduce signals upon binding to 
ligand. 



The invention also provides a method of identifying a 
10 desired odorant ligand which comprises contacting 
transformed non-olfactory cells expressing a known odorant 
receptor with a series of odorant ligands to determining 
which ligands bind to the receptors present on the non- 
olfactory cells. 

15 

Additionally, the invention provides a method of identifying 
a desired odorant receptor comprising contacting a series of 
transformed non-olfactory cells with a known odorant ligand 
and determining which odorant receptor binds with the 
20 odorant ligand. 

The invention provides a method of detecting an odor which 
comprises: a) identifying a odorant receptor which binds 
the desired odorant ligand and; b) imbedding the receptor 
25 in a membrane such that when the odorant ligand binds to the 
receptor so identified a detectable signal is produced. In 
one embodiment of the invention the membrane used in this 
method is cellular, including a membrane of an olfactory 
cell or a synthetic membrane. 

30 

The ligand tested for may be the vapors emanating from 
Cocaine, Marijuana, Heroin, Hashish, Angel Dust, gasoline, 
decayed human flesh, alcohol, gun powder explosives, plastic 
explosives or firearms. In another embodiment the ligand 
35 tested for may be natural gas, a pheromone, toxic fumes, 
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noxious fumes or dangerous fumes. 

In one embodiment of the invention the detectable signal is 
a lightbulb lighting up, a buzzer buzzing, a bell ringing, 
5 a color change, phosphorescence, or radioactivity. 

The invention further provides a method of quantifying the 
amount of an odorant ligand present in a sample which 
comprises utilizing the above-mentioned method for odor 
10 detection and then quantifying the amount of signal 
produced . 

The invention further provides a method of developing 
fragrances which comprises identifying a desired odorant 

15 receptor by the above method, then contacting non-olfactory 
cells, which have been transfected with an expression vector 
containing nucleic acid encoding the desired odorant 
receptor such that the receptor is expressed upon the 
surface of the non-olfactory cell, with a series of 

20 compounds to determine which compound or compounds bind the 
receptor . 

The invention provides to a method of identifying an 
"odorant fingerprint" which comprises contacting a series of 
25 cells, which have been transformed such that each express a 
Known odorant receptor, with a desired sample and 
determining the type and quantity of the odorant ligands 
present in the sample. 

30 The invention provides a method of identifying odorant 
ligands which inhibit the activity of a desired odorant 
receptor which comprises contacting the desired odorant 
receptor with a series of compounds and determining which 
compounds inhibit the odorant ligand - odorant receptor 

35 interaction. 
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The invention also provides for a method of identifying 
appetite suppressant compounds which comprises identifying 
odor ant ligands by the method mentioned in the preceding 
paragraph wherein the desired odorant receptor is that which 
5 is associated with the perception of food. Additionally, 
the invention provides a method of controlling appetite in 
a subject which comprises contacting the olfactory 
epithelium of the subject with these odorant ligands. 
Further the invention provides a nasal spray, to control 
10 appetite comprising the compounds identified by the above 
method in a suitable carrier. 



The invention provides a method of trapping odors which 
comprises contacting a membrane which contains multiples of 
15 the desired odorant receptor, with a sample such that the 
desired odorant ligand is absorbed by the binding of the 
odorant ligand to the odorant receptor. The invention also 
provides an odor trap employing this method. 



20 The invention also provides a method of controlling pest 
populations which comprises identifying odorant ligands by 
the method mentioned above which are alarm odorant ligands 
and spraying the desired area with the identified odorant 
ligands. Additionally, provided by the invention is a 

2 5 method of controlling a pest population which comprises 

identifying odorant ligands by the above mentioned method, 
which interfere with the interaction between the odorant 
ligands and the odorant receptors which are associated with 
fertility. In one embodiment the pest population is a 
30 population of insects or rodents, including mice and rats. 

The invention also provides a method of promoting fertility 
which comprises identifying odorant ligands which interact 
with the odorant receptors associated with fertility by the 

3 5 above mentioned method. Further, the invention provides a 
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method of inhibiting fertility which comprises employing the 
above mentioned method to identifying odorant ligands which 
inhibit the interaction between the odorant ligands and the 
odorant receptors associated with fertility, 

5 

This invention is illustrated in the Experimental Detail 
section which follow. These sections are set forth to aid 
in an understanding of the invention but are not intended 
to, and should not be construed to, limit in any way the 
10 invention as set forth in the claims which follow 
thereafter. 



15 



WCTHQPg 



Polymerase Chain Reaction 



20 RNA was prepared from the olfactory epithelia of Sprague 
Dawley rats according to Chirgwin et al. (40) or using 
RNAzol B (Cinna/Biotecx) and then treated with DNase I (0.1 
unit/jig RNA) (Promega) . In order to obtain cDNA, this RNA 
was incubated at 0.1 ng/nl with 5 /*M random hexamers 

25 (Pharmacia) 1 mM each of dATP, dCTP, dGTP, TTP, and 2 
units/Ml RNase inhibitor (Promega) in 10 mM TrisCl (pK 8.3), 
50 mM KC1, 2.5 mM MgCl 2 , and 0.001% gelatin for 10 min. at 
22 °C, and then for a further 45 min. at 37 D c following the 
addition of 20 u./m1 of Moloney murine leukemia virus 

30 reverse transcriptase (BRL) . After heating at 95°c for 3 
min., cDNA prepared from 0.2 jig of RNA was used in each of 
a series of polymerase chain reactions (PCR) containing 10 
mM TrisCl (pH 8.3) , 50 mM KC1, 1.5 mM MgCl 2 , 0.001% gelatin, 
200 each of dATP, dCTP, dGTP, and TTP, 2.5 u. Tag 

3 5 polymerase (Perkin Elmer Cetus) , and 2 ^M of each PCR 
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primer. PCR reactions were performed according to the 
following schedule: 96°C for 45 sec, 55°C for 4 Bin. (or 
45°C for 2 min.)/ 72°C for 3 min. with 6 sec. extension per 
cycle for 48 cycles. The primers used for PCR were a series 
5 of degenerate oligonucleotides made according to the amino 
acid sequences found in transmembrane domain 2 and 7 of a 
variety of different members of the 7 transmembrane domain 
protein superfamily (19). The regions used correspond to 
amino acids number 60-70 and 286-295 of clone 115 (Figure 
10 4). Each of five different 5* primers were used in PCR 
reactions with each of six different 3* primers. The 5* 
primers had the sequences: 



C AC AC CT 
15 Al # AATTGGATICTIGTIAATCTIGCIGTIGCIGCIGA; 

C C CA A C C 
A2, AATTATTTTCTIGTIAATCTIGCITTIGCIGA; 

20 CCA CC A C 

A3 , AATTTITTTATIATITCICTIGCITGIGCIGA; 

AT C T ACT C 

A4, CGITTICTIATGTGTAACCTITGCTTTGCIGA; 

25 

C CT TG 

A5 , ACIGTITATATIACI CATCTIACIATIGCIGA . 

The 3 1 primers were: 

30 

TTA T CAG CCA 

Bl , CTG I CGGTTCATI AA I A CATAI ATI ATI GGGTT ; 

TG GA G G A A 

35 B2, GATCGTTIAGACAACAATAIATIATIGGGTT ; 

AG G A 

B3, TCI ATGTTAAAIGTIGTATAIATI ATI GGGTT ; 

40 T G G A A 

B4, GCCTTIGTAAAI ATIGCATAIAGGAAIGGGTT ; 

G AGA G G G A 

B5, AAATCIGGGCTICGICAATAIATCAIIGGGTT; 

45 

CT CT G G G G A 
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B6, GAIGAI CCI ACAAAAAAATAI ATAAAI GGGTT . 

An aliquot of each PCR reaction was analyzed by agarose gel 
5 electrophoresis and bands of interest were amplified further 
by performing PCR reactions on pipet tip (approx. 1 pi) 
plugs of the agarose gels containing those DNAs. Aliquots 
of these semi-purified PCR products were digested with the 
restriction enzymes Hae III or Hinf I and the digestion 
10 products were compared with the undigested DNAs on agarose 
gels. 

Isolation and Analysis of cDNA Clones 

15 CDNA libraries were prepared according to standard 
procedures (41, 42) in the cloning vector, A, ZAP II 
(Stratagene) using poly A + RNA prepared from Sprague Dawley 
rat epithelia (see above) or from an enriched population of 
olfactory neurons which had been obtained by a 'panning* 

20 procedure, using an antibody against the H blood group 
antigen (Chembiomed) found on a large percentage of rat 
olfactory neurons. In initial library screens, 8.5 X 10 s 
independent clones from the olfactory neuron library and 1.8 
X 10 6 clones from the olfactory epithelium library were 

25 screened (41) with a 32 P-labeled probe (prime-it, 
Stratagene) consisting of a pool of gel-isolated PCR 
products obtained using primers A4 and B6 (see above) in PCR 
reactions using as template, olfactory epithelium cDNA, rat 
liver DNA, or DNA prepared from the two cDNA libraries. In 

30 later library screens, a mixture of PCR products obtained 
from 20 cDNA clones with the A4 and B6 primers was used as 
probe ('Pi 1 probe). In initial screens, phage clones were 
analyzed by PCR using primers A4 and B6 and those which 
showed the appropriate size species were purified. In later 

35 screens, all position clones were purified, but only those 
that could be amplified with the B6 primer and a primer 
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specific for vector sequence were analyzed further. To 
obtain plasmids from the isolated phage clones, phagemid 
rescue was performed according to the instructions of the 
manufacturer of AZAP II (Stratagene) . DNA sequence analysis 
5 was performed on plasmid DNAs using the Seguenase system 
(USB) , initially with the A4 and B6 primers and later with 
oligonucleotide primers made according to sequences already 
obtained. 

10 Northern and Southern Blot Analyses 

For Northern blots, poly A + RNAs from various tissues were 
prepared as described above or purchased from Clontech. One 
jig of each RNA was size fractionated on formaldehyde agarose 

15 gels and blotted onto nylon membranes (41, 42)* For 
Southern blots, genomic DNA prepared from Sprague Dawley rat 
liver was digested with the restriction enzymes Eco RI or 
Hind III, size fractionated on agarose gels and blotted onto 
nylon membranes (41, 42) . The membranes were dried at 80°C, 

20 and then prehybridized in 0.5 N sodium phosphate buffer (pH 
7.3) containing 1% bovine serum albumin and 4% sodium 
dodecyl sulfate. Hybridization was carried out in the same 
buffer at 65°-70°C for 14-20 hrs. with DNAs labeled with 
32 P. For the first Northern blot shown, the 'PI • probe (see 

25 above under cDNA clone isolation) was used* For the second 
Northern blot shown, a mix of PCR fragments from seven 
divergent cDNA clones was used. For Southern blots, the 
region indicated in clone 115 by amino acids 118 through 251 
was amplified from a series of divergent cDNA clones using 

3 0 PCR. The primers used for these reactions had the 
sequences: 

PI, ATGGCITATGATCGITATGTIGC, and 
35 P4, AAIAGIGAIACIATIGAIAGATGIGAICC 
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These ONAs (or a DNA encompassing transmembrane domains 2 
through 7 for clone F6) were labeled and tested for 
crosshybridization at 70 °C. Those DNAs which did not show 
appreciable crosshybridization were hybridized individually, 
5 or as a pool to Southern blots at 70° C. 

Rat Sequences used to obtain similar sequences ex pressed in 
Humans 



10 There are genes similar to the rat genes discussed above 
present in humans, these genes may be readily isolated by 
screening human gene libraries with the cloned rate 
sequences or by performing PCR experiments on human genomic 
DNA with primers homologous to the rat sequences. First, 

15 PCR experiments were performed with genomic DNA from rat, 
human, mouse, and several other species. When primers 
homologous to transmembrane domains 2 and 6 (the A4/B6 
primer set used to isolate the original rat sequences) were 
used, DNA of the appropriate size was amplified from rat, 

20 human and mouse DNAs. When these primary PCR reactions were 
subsequently diluted and subjected to PCR using primers to 
internal sequences (PI and P4 primers) , smaller DNA species 
were amplified whose size was that seen when the same 
primers were used in PCR reactions with the cloned rat 

25 cDNAs. Similarly, when the secondary PCR was performed with 
one outer primer together with one inner primer (ie. A4/P4 
or P1/B6) , amplified DNAs were obtained whose sizes were 
also consistent with the amplification of genes similar in 
sequence and organization to the cloned rat cDNAs. Second, 

30 a mix of segments from 20 of the rat cDNAs ('Pi" probe) was 
used to screen libraries constructed from human genomic 
DNAs. Hybridization under high or low stringency conditions 
reveals the presence of a large number of cloned human DNA 
segments that are homologous to the rat sequences. 

3 5 Finally, RNA from a human olfactory tumor (neuroesthesioma, 
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NCI-H-1011) cell line has been examined for sequences 
homologous to those cloned in the rat. cDKA prepared from 
this RNA was subjected to PCR with the A4/B6 primer set and 
a DNA species of the appropriate size was seen. This DNA 
5 was subcloned and partially sequenced and clearly encodes a 
member of the olfactory protein family identified in the 
rat. 

The inserted sequence in human clones H3/H5 was amplified by 
10 PCR with the A4/B6 primers, gel purified, and then labeled 
with 32P. The labeled DNA was then hybridized to 
restriction enzyme human placenta. Multiple hybridizing 
species were observed with each DNA (See Figure 32) . This 
observation is consistent with the presence of a family of 
15 odorant receptor genes in the human genome. 

The sequence of clone H5 is hereby shown in Figure 19. In 
addition, the translated protein sequence is shown in Figure 

19. 

20 

In other to identify odorant receptors in other species, 
degenerated primer oligonucleotides homologous to conserved 
regions within the rat odorant receptor family may be used 
in PCR reactions with genomic DNA or with cDNA prepared from 
25 olfactory tissue RNA from those species. 

RESULTS 

Cloning the Gene Family 

30 A series of degenerate oligonucleotides were designated 
which could anneal to conserved regions of members of the 
superfamily of G-protein coupled seven transmembrane domain 
receptor genes. Fiv degenerate oliogonucleotides (Al-5; 
see Experimental Procedures) matching sequences within 

35 transmembrane domain 2, and six degenerate oligonucleotides 
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(Bl-6) matching transmembrane domain 7 were used in all 
combinations in PCR reactions to amplify homologous 
sequences in cOMA prepared from rat olfactory epithelium 
RNA. The amplification products of each PCR reaction were 
5 then analyzed by agarose gel electrophoresis. Multiple 
bands were observed with each of the primer combinations. 
The PCR products within the size range expected for this 
family of receptors (600 to 1300 bp) were subsequently 
picked and amplified further with the appropriate primer 
10 pair in order to isolate individual PCR bands. Sixty-four 
PCR bands isolated in this fashion revealed only one or a 
small number of bands upon agarose gel electrophoresis. 
Representatives of these isolated PCR products are shown in 
Figure 2A. 

15 

The isolated PCR products were digested with the 
endonuclease, Hae III or Hinf I, which recognize four base 
restriction sites and cut DNA at frequent intervals. In 
most instances, digestion of the PCR product with Hinf I 

20 generated a set of fragments whose molecular weights sum to 
the size of the original DNA (Figure 2B) . These PCR bands 
are therefore likely to each contain a single DNA species. 
In some cases, however, restriction digestion yielded a 
series of fragments whose molecular weights sum to a value 

25 greater than that of the original PCR product. The most 
dramatic example is shown in Figure 2 where the 710 bp, PCR 
13 DNA, is cleaved by Hinf I to yield a very large number of 
restriction fragments whose sizes sum to a value five- to 
ten-fold greater than that of the original PCR product. 

3 0 These observations indicated that PCR product 13 consists of 
a number of different species of DNA, each of which could be 
amplified with the same pair of primer oligonucleotides. In 
addition, when PCR experiments similar to those described 
were performed using cDNA library DNAs as templates, a 710 

35 bp PCR product was obtained with the PCR13 primer pair 
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(A4/B6) with DNA from olfactory cDNA libraries, but not a 
, glioma cDNA library. Moreover, digestion of one of this 710 
bp product also revealed the presence of multiple DNA 
species. In other cases (see PCR product 20, for example), 
5 digestion yielded a series of restriction fragments whose 
molecular weights also sum to a size greater than the 
starting material. Further analysis, however, revealed that 
the original PCR product consisted of multiple bands of 
similar but different sizes. 

10 

In order to determine whether the multiple DNA species 
present in PCR 13 encode members of a family of seven 
transmembrane domain proteins, PCR 13 DNA was cloned into 
the plasmid vector Bluescript and five individual clones 

15 were subjected to DNA sequence analysis. Each of the five 
clones exhibited a different DNA sequence, but each encoded 
a protein which displayed conserved features of the 
superf amily of seven transmembrane domain receptor proteins. 
In addition, the proteins encoded by all five clones shared 

20 distinctive sequence motifs not found in other superfamily 
members indicating they were all members of a new family of 
receptors. 

To obtain full-length cDNA clones, cDNA libraries prepared 
25 from olfactory epithelium RNA or from RNA of an enriched 
population of olfactory sensory neurons were screened. The 
probe used in these initial screens was a mixture of PCR 13 
DNA as well as DNA obtained by amplication of rat genomic 
DNA or DNA from two olfactory cDNA libraries with the same 
30 primers used to generate PCR 13 (A4 and B6 primers) . 
Hybridizing plaques were subjected to PCR amplification with 
the A4/B6 primer set and only those giving a PCR product of 
the appropriate size (approximately 710 bp) were purified. 
The frequency of such positive clones in the enriched 
3 5 olfactory neuron cDNA library was approximately five times 
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greater than the frequency in the olfactory epithelium cDNA 
library. The increased frequency of positive clones 
observed in the olfactory neuron library is comparable to 
the enrichment in olfactory neurons generally obtained in 
5 the purification procedure. 

The original pair of primers used to amplify PCR 13 DNA were 
then used to amplify coding segments of 20 different cDNA 
clones. A mix of these PCR products were labeled and used 

10 as probe for further cDNA library screens. This mixed probe 
was also used in a Northern blot (Figure 3) to determine 
whether the expression of the gene family is restricted to 
the olfactory epithelium. The mixed probe detects two 
diffuse bands centered at 2 and 5 kb in RNA from olfactory 

15 epithelium; no hybridization can be detected in brain or 
spleen. (Later experiments which examined a larger number 
of tissue RKAs with a more restricted probe will be shown 
below.) Taken together, these data indicate the discovery 
of a novel multigene family encoding seven transmembrane 

20 domain proteins which are expressed in olfactory epithelium, 
and could be expressed predominantly or exclusively in 
olfactory neurons. 

ThS. EJTPtein Sequences of Numerous, Olfactor y-specific 

25 Members of the Se ven Transmembrane Domain Superfamily 

Numerous clones were obtained upon screening cDNA libraries 
constructed from olfactory epithelium and olfactory neuron 
RNA at high stringency. Partial DNA sequences were obtained 

30 from 36 clones; 18 of these cDNA clones are different, but 
all of them encode proteins which exhibit shared sequence 
motifs indicating that they are members of the family 
identified in PCR 13 DNA. A complete nucleotide sequence 
was det rmined for coding regions of ten of the most 

35 divergent clones (Figure 4). The deduced protein sequences 
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of these cDNAs defines a new multigene family which shares 
sequence and structural properties with the super family of 
neurotransmitter and hormone receptors that traverse the 
membrane seven times. This novel family, however, exhibits 
5 features different from any other member of the receptor 
super family thus far identified. 

Each of the ten sequences contains seven hydrophobic 
stretches (19-26 amino acids) that represent potential 

10 transmembrane domains. These domains constitute the regions 
of maximal sequence similarity to other members of the seven 
transmembrane domain superfamily (see legend to Figure 4). 
On the basis of structural homologies with rhodopsin and the 
6-adrenergic receptors, (19) it is likely that the amino 

15 termini of the olfactory proteins are located on the 
extracellular side of the plasma membrane and the carboxyl 
termini are located in the cytoplasm. In this scheme, three 
extracellular loops alternate with three intracellular loops 
to link the seven transmembrane domains (see Figure 5) . 

20 Analysis of the sequences in figure 4 demonstrates that the 
olfactory proteins, like other members of the receptor 
superfamily, display no evidence of an N-terminal signal 
sequence. As in several other superfamily members, a 
potential N- linked glycosylation site is present in all ten 

25 proteins within the short N-terminal extracellular segment. 
Other structural features conserved with previously 
identified members of the superfamily included cysteine 
residues at fixed positions within the first and second 
extracellular loops that are thought to form a disulfide 

30 bond. Finally, many of the olfactory proteins reveal a 
conserved cysteine within the C-terminal domain which may 
serve as a palmitoylation site anchoring this domain to the 
membrane (21) . These features, taken together with several 
short, conserved sequence motifs (see legend to Figure 4) , 

35 clearly define this new family as a member of the 
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super family of genes encoding the seven transmembrane domain 
receptors. 

There are, however, important differences between the 
5 olfactory protein family and the other seven transmembrane 
domain proteins described previously and these differences 
may be relevant to proposed function of these proteins in 
odor recognition. Structure* function experiments involving 
in vitro mutagenesis suggest that adrenergic ligands 

10 interact with this class of receptor molecule by binding 
within the plane of the membrane (22, 20). Not 
surprisingly, small receptor families that bind the same 
class of ligands, such as the adrenergic and muscarinic 
acetylcholine receptor families exhibit maximum sequence 

15 conservation (often over 80%) within the transmembrane 
domains. In contrast, the family of receptors discussed in 
this application shows striking divergence within the third, 
fourth, and fifth transmembrane domains (Figure 4). The 
variability in the three central transmembrane domains is 

20 highlighted schematically in Figure 5. The divergence in 
potential ligand binding domains is consistent with the idea 
that the family of molecules cloned is capable of 
associating with a large number of odorant of diverse 
molecular structure. 

25 

Receptors which belong to the superfamily of seven 
transmembrane domain proteins interact with G-proteins to 
generate intracellular signals. 1b vitro mutagenesis 
experiments indicate that one site of association between 

30 receptor and G-protein resides within the third cytoplasmic 
loop (22, 23). The sequence of this cytoplasmic loop in 18 
different clones we have characterized is shown in Figure 
6A. This loop which is often quite long and of variable 
length in the receptor superfamily is relatively short (only 

35 17 amino acids) and of fixed length in the 18 clones 
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examined. Eleven of the 16 different clones exhibit the 
sequence motif K/R I V S S I (or a close relative) at the N- 
terminus of this loop. Two of the cDNA clones reveal a 
different HIT C/W A V motif at this site. If this short 
5 loop is a site of contact with G-proteins, it is possible 
that the conserved motifs may reflect sites of interaction 
with different G-proteins that activate different 
intracellular signalling systems in response to odors. In 
addition, the receptors cloned reveal several serine or 
10 threonine residues within the third cytoplasmic loop. By 
analogy with other G-protein coupled receptors, these 
residues may represent sites of phosphorylation for specific 
receptor kinases involved in desensitization. (24) 



15 Subfamilies within the Mul tiaene Family 



Figure 6A displays the sequences of the fifth transmembrane 
domain and the adjacent cytoplasmic loop encoded by L8 of 
the cDNA clones we have analyzed. As a group, the 18 

20 sequences exhibit considerable divergence within this 
region. The multigene family, however, can be divided into 
subfamilies such that the members of a given subfamily share 
significant sequence conservation. In subfamily B, clones 
F12 and F13, for example, differ from one another at only 

25 four of 44 positions (91% identify), and clearly define a 
subfamily. Clones F5 and 111 (subfamily D) differ from F12 
and F13 at 34-36 positions within this region and clearly 
define a separate subfamily. Thus, this olfactory-specific 
multigene family consists of highly divergent subfamilies. 

30 If these genes encode odor receptors, it is possible that 
members of the divergent subfamilies bind odorant of widely 
differing structural classes. Members of the individual 
subfamilies could therefore recognize more subtle 
differences between molecules which belong to the same 

35 structural class of molecules structures. 
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The Size of the Multiaene Family 

Genomic Southern blotting experiments were preformed and 
genomic libraries were screened to obtain an estimate of the 
5 sizes of the multigene family and the member subfamilies 
encoding the putative odor receptors. DNAs extending from 
the 3* end of transmembrane domain 3 to the middle of 
transmembrane domain 6 were synthesized by PCR from DNA of 
seven of the divergent cDNA clones (Figure 4). In initial 

10 experiments, these DNAs were labeled and hybridized to each 
other to define conditions under which minimal 
crosshybridization would be observed among the individual 
clones. At 70 °C, the seven DNAs showed no 

crosshybridization, or crosshybridized only very slightly. 

15 The trace levels of crosshybridization observed are not 
likely to be apparent upon genomic Southern blot analysis 
where the amounts of DNA are far lower than in the test 
cross . 

20 Probes derived from these seven DNAs were annealed under 
stringent conditions, either individually or as a group, to 
Southern blots of rat liver DNA digested with the 
restriction endonucleases Eco RI or Hind III (Figure 7). 
Examination of the Southern blots reveals that all but one 

25 of the cDNAs detects a relatively large, distinctive array 
of bands in genomic DNA. Clone 115 (probe 7) , for example, 
detects about 17 bands with each restriction endonuclease, 
whereas clone F9 (probe l) detects only about 5-7 bands with 
each enzyme. A single band is obtained with clone 17 (probe 

30 5) . PCR experiments using nested primers (TM2/TM7 primers 
followed by primers to internal sequences) and genomic DNA 
as template indicate that the coding regions of the members 
of this multigene family, like those of many members of the 
G-protein coupled super family, may not be interrupted by 

35 introns. This observation, together with the fact that most 
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of the probes only encompasses 400 nucleotides suggests that 
each band observed in these experiments is likely to 
represent a different gene. These data suggest that the 
individual probes chosen are representatives of subfamilies 
5 which range in size from a single member to as many as 17 
members. A total of about 70 individual bands were detected 
in this analysis which could represent the presence of at 
least 70 different genes. Although the DMA probes used in 
these blots did not crosshybridize appreciably with each 

10 other, it is possible that a given gene might hybridize to 
more than one probe, resulting in an overestimate of gene 
number. However, it is probable that the total number of 
bands only reflects a minimal estimate of gene number since 
it is unlikely that we have isolated representative cDNAs 

15 from all of the potential subfamilies and the hybridizations 
were performed under conditions of very high stringency. 

A more accurate estimate of the size of the olfactory- 
specific gene family was obtained by screening rat genomic 

20 libraries. The mix of the seven divergent probes used in 
Southern blots, or the mix of 20 different probes used in 
our initial Northern blots (see Figure 3), were used as 
hybridization probes under high (65°C) or lowered (55°C) 
stringency conditions in these experiments. Nested PCR (see 

25 above) was used to verify that the clones giving a positive 
signal under low stringency annealing conditions were indeed 
members of this gene family. It is estimated from these 
studies that there are between 100 and 200 positive clones 
per haploid genome. The estimate of the size of the family 

3 0 obtain from screens of genomic libraries again represents a 
lower limit. Given the size of the multigene family, one 
might anticipate that many of these genes are linked such 
that a given genomic clone may contain multiple genes. Thus 
the data from Southern blotting and screens of genomic 

35 libraries indicate that the multigene family identified 
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consists of one to several hundred member genes which can be 
divided into multiple subfamilies. 



It should be noted that the cDNA probes isolated may not be 
5 representative of the full complement of subfamilies within 
the larger family of olfactory proteins. The isolation of 
cDNAs, for example, relies heavily on PGR with primers from 
transmembrane domains 2 and 7 and biases our clones for 
homology within these regions. Thus, estimates of gene 
10 number as well as subsequent estimates of RKA abundance 
should be considered as minimal. 



Expression of the Members of this Multiaene Family 

15 Additional Northern blot analyses were preformed to 
demonstrate that expression of the members of this gene 
family is restricted to the olfactory epithelium. (Figure 8) 
Northern blot analysis with a mixed probe consisting of the 
seven divergent cDNAs used above reveals two diffuse bands 

20 about 5 and 2 kb in length in olfactory epithelium RKA. 
This pattern is the same as that seen previously with the 
mix of 20 DNAs. No annealing is observed to RNA from the 
brain or retina or other, nonneural tissues, including lung, 
liver, spleen, and kidney. 

25 

An estimate of the level of expression of this family can be 
obtained from screens of cDNA libraries. The frequency of 
positive clones in cDNA libraries made from olfactory 
epithelium RNA suggests that the abundance of the RNAs in 

30 the epithelium is about one in 20,000. The frequency of 
positive clones is approximately five-fold higher in a cDNA 
library prepared from RNA from purified olfactory neurons 
(in which 75% of the cells are olfactory neurons) . The 
increased frequency of positive clones obtained in the 

35 olfactory neuron cDNA library is comparable to the 
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enrichment we obtain upon purification of olfactory neurons. 
These observations suggest that this multigene family is 
expressed largely, if not solely, in olfactory neurons and 
may not be expressed in other cell types within the 
5 epithelium. If each olfactory neuron contains 10 5 mRNA 
molecules, from the frequency of positive clones we predict 
that each neuron contains only 25-30 transcripts derived 
from this gene family. Since the family of olfactory 
proteins consists of a minimum of a hundred genes, a given 
10 olfactory neuron could maximally express only a proportion 
of the many different family members. These values thus 
suggest that olfactory neurons will exhibit significant 
diversity at the level of expression of these olfactory 
proteins. 

15 

Identification of pheromone receptors in vomeronasal organ 
The vomeronasal organ (vomeronasal gland) is an accessory 
olfactory structure that is located near the nasal cavity. 
Like the olfactory epithelium of the nasal cavity, the 

20 olfactory epithelium of the vomeronasal organ contains 
olfactory sensory neurons. The vomeronasal organ is 
believed to play an important role in the sensing of 
pheromones in numerous species. Pheromone s are believed to 
have profound effects on both physiological and behavioral 

25 aspects of reproduction, the identification of pheromone 
receptors would permit the identification of the pheromones 
themselves. It would also enable one to identify agonists or 
antagonists that would either mimic the pheromones or block 
the pheromone receptors from transducing pheromone signals. 

30 Such information would be important to the development of 
species specific pesticides and, conversely, to animal 
husbandry. The identification of pheromone receptors in 
human could ultimately lead to the development of 
contraceptives or to treatments for infertility in humans. 

35 It is likely that the identification of pheromone receptors 
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in low mammals such as rodents would lead to the 
identification of similar receptors in human. 

In order to identify potential pheromone receptors, we 
5 isolate RNA from the vomeronasal organs of female rats and 
prepared cDNA from this RNA. The cDNA was subjected to PCR 
with several different pairs of degenerate oligonucleotide 
primers that match sequences present in the rat odorant 
receptor family. The PCR products were subcloned and the 

10 nucleotide sequences of the subcloned DNAs were determined. 
Each of the subcloned DNAs encodes a protein that belongs to 
the odorant receptor family. The sequences of the following 
vomeronasal subclones are shown: Jl, J2, J4, J7, J8, Jll, 
J14, J15, J16, J17, J19, J20. In a few cases (J2, J4), the 

15 same sequence was amplified with two different primer pairs 
and the sequence shown is a composite of the two sequences. 
It is possible that one or more of these molecules, or 
closely related molecules, serve as pheromone receptors in 
the rat. 



20 



DISCUSSION 



The mammalian olfactory system can recognize and 
discriminate a large number of odorous molecules. 

25 Perception in this system, as in other sensory systems, 
initially involves the recognition of external stimuli by 
primary sensory neurons. This sensory information is then 
transmitted to the brain where it is decoded to permit the 
discrimination of different odors. Elucidation of the logic 

30 underlying olfactory perception is likely to require the 
identification of the specific odorant receptors, the 
analysis of the extent of receptor diversity and receptor 
specificity, as well as an understanding of the pattern of 
receptor expression in the olfactory epithelium. 

35 
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The odorant receptors are thought to transduce intracellular 
signals by interacting with G-proteins which activate second 
messenger systems (12, 13, 14, 15). These proteins are 
clearly members of the family of G-protein coupled receptors 
5 which traverse the membrane seven times (19) . The odorant 
receptors should be expressed specifically in the tissue in 
which odorant are recognized. The family of olfactory 
proteins cloned is expressed in the olfactory epithelium* 
Hybridizing RNA is not detected in brain or retina, or in a 

10 host of nonneural tissues. Moreover, expression of this 
gene family the epithelium may be restricted to olfactory 
neurons. The family of odorant receptors must be capable of 
interacting with extremely diverse molecular structures. 
The genes cloned are members of any extremely large 

15 multigene family which exhibit variability in regions 
thought to be important in ligand binding. The possibility 
that each member of this large family of seven transmembrane 
proteins is capable of interacting with only one or a small 
number of odorant provides a plausible mechanism to 

20 accommodate the diversity of odor perception. The 
properties of the gene family identified suggests that this 
family is likely to encode a large number of distinct 
odorant receptors. 

25 Size of the Multio ene Family 

The size of the receptor repertoire is lilcely to reflect the 
range of detectable odors and the degree of structural 
specificity exhibited by the individual receptors. It has 

30 been estimated that humans can identify over 10,000 
structurally-distinct odorous ligands. However, this does 
not necessarily imply that humans possess an equally large 
repertoire of odorant receptors. For example, binding 
studies in lower vertebrates suggest that structurally- 

35 related odorant may activate the same receptor molecules. 
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In fish which smell amino acids, the binding of alanine to 
isolated cilia can be competed by other small polar residues 
(threonine and serine) , but not by the basic amino acids, 
lysine or arginine (11) . These data suggest that individual 
5 receptors are capable of associating with several 
structurally-related ligands, albeit with different 
affinities. Stereochemical models of olfactory recognition 
in mammals (25) (based largely on psychophysical, rather 
than biophysical data) have suggested existence of several 
10 primary odor groups including camphoraceous , musky, 
peppermint, ethereal, pungent, and putrid. In such a model, 
each group would contain odorant with common molecular 
configurations which bind to common receptors and share 
similar odor qualities. 

15 

Screens of genomic libraries with mixed probes consisting of 
divergent family members detect approximately 100 to 200 
positive clones per genome. The present estimate of at 
least 100 genes provides only a lower limit since it is 

20 likely that the probes used do not detect all of the 
possible subfamilies* Moreover, it is probable that many of 
these genes are linked such that a given genomic clone may 
contain multiple genes. It is therefore expected that the 
actual size of the gene family may be considerably higher 

25 and this family of putative odorant receptors could 
constitute one of the largest gene families in the genome. 

The characterization of a large multigene family encoding 
putative odorant receptors suggests that the olfactory 

3 0 system utilizes a far greater number of receptors than the 
visual system. Color vision, for example, allows the 
discrimination of several hundred hues, but is accomplished 
by only three different photoreceptors (1, 2, 3 and 4). The 
photoreceptors each have different, but overlapping 

3 5 absorption spectra which cover the entire spectrum of 
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visible wavelengths. Discrimination of color results from 
comparative processing of the information from these three 
classes of photoreceptors in the brain. Whereas three 
photoreceptors can absorb light across the entire visible 
5 spectrum, our data suggest that a small number of odorant 
receptors cannot recognize and discriminate the full 
spectrum of distinct molecular structures perceived by the 
mammalian olfactory system. Rather, olfactory perception 
probably employs an extremely large number of receptors each 
10 capable of recognizing a small number of odorous ligands. 

Diversity within the Gene Family and the Specificity of Odor 

fiecogni.Uoo 

15 The olfactory proteins identified in this application are 
clearly members of the superfamily of receptors which 
traverse the membrane seven time. Analysis of the proteins 
encoded by the 18 distinct cDNAs we have cloned reveals 
structural features which may render this family 

20 particularly well suited for the detection of a diverse 
array of structurally distinct odorant. Experiments with 
other members of this class of receptors suggest that the 
ligand binds to its receptor within the plane of the 
membrane such that the ligand contacts many, if not all of 

25 the transmembrane helices. The family of olfactory proteins 
can be divided into several different subfamilies which 
exhibit significant sequence divergence within the 
transmembrane domains. Nonconservative changes are commonly 
observed within blocks of residues in transmembrane regions 

30 3, 4, and 5 (Figures 4, 5, 6); these blocks could reflect 
the sites of direct contact with odorous ligands. Some 
members, for example, have acidic residues in transmembrane 
domain 3, which in other families are thought to be 
essential for binding aminergic ligands (20) while other 

35 members maintain hydrophobic residues at these positions. 
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This divergence within transmembrane domains may reflect the 
fact that the members of the family of odorant receptors 
must associate with odorant of widely different molecular 
structures . 

5 

These observations suggest a model in which each of the 
individual subfamilies encode receptors which bind distinct 
structural classes of odorant. Within a given subfamily, 
however, the sequence differences are far less dramatic and 

10 are often restricted to a small number of residues. Thus, 
the members of a subfamily may recognize more subtle 
variations among odor molecules of a given structural class. 
At a practical level, individual subfamilies may recognize 
grossly different structures such that one subfamily may 

15 associate, for example, with the aromatic compound, benzene 
and its derivatives, whereas a second subfamily may 
recognize odorous, short chain, aliphatic molecules. Subtle 
variations in the structure of the receptors within, for 
example, the hypothetical benzene subfamily could facilitate 

20 the recognition and discrimination of various substituted 
derivatives such as toluene, xylene or phenol. It should be 
noted that such a model, unlike previous stereochemical 
models, does not necessarily predict that molecules with 
similar structures will have similar odors. The activation 

25 of distinct receptors with similar structures could elicit 
different odors, since perceived odor will depend upon 
higher order processing of primary sensory information. 



30 



ion of the Gene Family and the Generation of Div 



Preliminary evidence from PCR analyses suggests that members 
of this family of olfactory proteins are conserved in lower 
vertebrates as well as invertebrates. This gene family 
presumably expanded over evolutionary time providing mammals 
35 with the ability to recognize an increasing diversity of 
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odorant. Examination of the sequences of the family members 
cloned from mammals provides some insight into the evolution 
of this multigene family. Although the chromosomal loci 
encoding these genes has yet to be characterized, it is 
5 likely that at least some member genes will be tandemly 
arranged in a large cluster as is observed with other large 
multigene families. A tandem array of this sort provides a 
template for recombination events including unequal crossing 
over and gene conversion, that can lead to expansion and 
10 further diversification of the sort apparent among the 
family members we have cloned (26) . 



The multigene family encoding the olfactory proteins is 
large: all of the member genes clearly have a common 

15 ancestral origin, but have undergone considerable divergence 
such that individual genes encode proteins that share from 
4 0-80% amino acid identity. Subfamilies are apparent with 
groups of genes sharing greater homology among themselves 
than with members of other subfamilies. Examination of the 

20 sequences of even the most divergent subfamilies, however, 
reveals a pattern in which several blocks of conserved 
residues are interspersed with variable regions. This 
segmental homology is conceptually similar to the 
organization of framework and hypervariable domains within 

25 the families of immunoglobulin and T cell receptor variable 
region sequences (27, 28). This analogy goes beyond 
structural organization and may extend to the function of 
these two gene families: each family consists of a large 
number of genes which have diversified over evolutionary 

30 time to accommodate the binding of a highly diverse array of 
ligands. The evolutionary mechanisms responsible for the 
diversification and maintenance of these large gene families 
may also be similar. It has been suggested that gene 
conversion has played a major role in the evolution of 

35 immunoglobulin and T cell receptor variable domains (29, 30 
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and 31) . Analysis of the sequence of the putative olfactory 
receptors reveals at least one instance where a motif from 
a variable region of one subfamily is found imbedded in the 
otherwise divergent sequence of a second subfamily, 
5 suggesting that conversion has occurred. Such a mixing of 
motifs from one subfamily to another over evolutionary time 
would provide additional combinatorial possibilities leading 
to the generation of diversity. 

10 It should be noted, however, that the combinatorial joining 
of gene segments by DNA rearrangement during development, 
which is characteristic of immunoglobulin loci (27), is not 
a feature of the putative odor receptor gene family. No 
evidence for DNA rearrangement to generate the diversity of 

15 genes cloned has been observed. The entire coding region 
has been sequenced along with parts of the 5' and 3* 
untranslated regions of 10 different cDNA clones. The 
sequences of the coding regions are all different; no 
evidence has been obtained for constant regions that would 

20 suggest DNA rearrangement of the sort seen in the immune 
system. The observations indicate that the diversity 
olfactory proteins are coded by a large number of distinct 
gene sequences. 

25 Although it is unlikely from the data that DNA rearrangement 
is responsible for the generation of diversity among the 
putative odorant receptors, it remains possible that DNA 
rearrangements may be involved in the regulation of 
expression of this gene family. If each olfactory neuron 

30 expresses only one or a small number of genes, then a 
transcriptional control mechanism must be operative to 
choose which of the more than one hundred genes within the 
family will be expressed in a given neuron. Gene conversion 
from one of multiple silent loci into a single active locus, 

35 as observed for the trypanosome-variable surface 
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glycoproteins (32) , provides one attractive model. The gene 
conversion event could be stochastic, such that a given 
neuron could randomly express any one of several hundred 
receptor genes, or regulated (perhaps by positional 
5 information) , such that a given neuron could only express 
one or a small number of predetermined receptor types. 
Alternatively, it is possible that positional information in 
the olfactory epithelium controls the expression of the 
family of olfactory receptors by more classical mechanisms 

10 that do not involve DNA rearrangement. What ever mechanisms 
will regulate the expression of receptor genes within this 
large, multigene family, these mechanisms must accommodate 
the requirement that olfactory neurons are regenerated every 
30-60 days (8) and therefore the expression of the entire 

15 repertoire of receptors must be accomplished many times 
during the life of an organism. 

Receptor Diversity and the Central Proc essing of Olfactory 
Information 

20 

The results suggest the existence of a large family of 
distinct odorant receptors. Individual members of this 
receptor family are likely to be expressed by only a small 
set of the total number of olfactory neurons. The primary 

2 5 sensory neurons within the olfactory epithelium will 

therefore exhibit significant diversity at the level of 
receptor expression. The question then emerges as to 
whether neurons expressing the same receptors are localized 
in the olfactory epithelium. Does the olfactory system 
30 employ a topographic map to discriminate among the numerous 
odorant? The spatial organization of distinct classes of 
olfactory sensory neurons, as defined by receptor 
expression, can now be determined by using the procedures of 
in situ hybridization and immunohistochemistry with probes 

3 5 specific for the individual receptor subtypes. This 
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information should help to distinguish between different 
models that have been proposed to explain the coding of 
diverse odorant stimuli (33) . 

5 In one model, sensory neurons that express a given receptor 
and respond to a given odorant may be localized within 
defined positions within the olfactory epithelium. This 
topographic arrangement would also be reflected in the 
projection of olfactory sensory axons into discrete regions 
10 (glomeruli) within the olfactory bulb. In this scheme, the 
central coding to permit the discrimination of discrete 
odorant would depend, in part, on the spatial segregation of 
different receptor populations. Attempts to discern the 
topographic localization of specific receptors at the level 
15 of the olfactory epithelium has led to conflicting results. 
In some studies, electrophysiological recordings have 
revealed differences in olfactory responses to distinct 
odorant in different regions of the olfactory epithelium 
(34, 35), However, these experiments have been difficult to 
20 interpret since the differences in response across the 
epithelium are often small and are not observed in all 
studies (36). 

A second model argues that sensory neurons expressing 
distinct odorant receptors are randomly distributed in the 
epithelium but that neurons responsive to a given odorant 
project to restricted regions within the olfactory bulb. In 
this instance, the discrimination of odors would be a 
consequence of the position of second order neurons in the 
olfactory bulb, but would be independent of the site of 
origin of the afferent signals within the epithelium. 
Mapping of the topographic projections of olfactory neurons 
has been performed by extracellular recordings from 
different regions of the bulb (37, 38) and by 2-deoxyglucose 
autoradiography to map regional activity after exposure to 
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different odor ant (39). These studies suggest that 
spatially -localized groups of bulbar neurons preferentially 
respond to different odorant. The existence of specific 
odorant receptors, randomly distributed through the 
5 olfactory epithelium, which converge on a common target 
within the olfactory bulb, would raise additional questions 
about the recognition mechanisms used to guide these 
distinct axonal subsets to their central targets. 

10 Other sensory systems also spatially segregate afferent 
input from primary sensory neurons. The spatial segregation 
of information employed/ for example, by the visual and 
somatosensory systems, is used to define the location of the 
stimulus within the external environment as well as to 

15 indicate the quality of the stimulus. In contrast, 
olfactory processing does not extract spatial features of 
the odorant stimulus. Relieved of the necessity to encode 
information about the spatial localization of the sensory 
stimulus, it is possible that the olfactory system of 

20 mammals uses the spatial segregation of sensory input solely 
to encode the identity of the stimulus itself. The 
molecular identification of the genes likely to encode a 
large family of olfactory receptors should provide initial 
insights into the underlying logic of olfactory processing 

25 in the mammalian nervous system. 
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(iv) ANT I -SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: rat olfactory apithaliua 

(B) STRAIN: Srpagua-Dawlay rat 

{?) TISSUE TYPE: olfactory apLthalium 

i 

(vii) IMMEDIATE SOURCE J 
(B) CLONE: F3 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 
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ATCCACTCAA 


GCAACAGGAC 


AAGACTTTCA 


GAATTTCTTC 


TTCTTGGATT 


TGTAGAAAAC 


60 


AAAGACCTAC 


AACCCCTTAT 


TTATGGTCTT 


TTTCTCTCTA 


TGTACCTGGT 


TACTGTCATT 


120 


GG AAA CAT AT 


CCATTATTGT 


CGCTATCATT 


TCACATCCCT 


GTCTGCAGAC 


CCCCATGTAT 


180 


TTCTTCCTCT 


CTAACCTGTC 


CTTTGTGCAC 


ATCTGTTTCA 


TTTCAACCAC 


TGTTCCAAAC 


240 


ATGTTAGTGA 


ACATCCAGAC 


CCAAAACAAT 


GTCATCACCT 


ATGCAGGATG 


CATTACCCAG 


300 


ATATACTTTT 


TCTTGCTCTT 


TGTACAATTG 


GACAACTTCT 


TGCTGACTAT 


CATGCCCTAT 


360 


GACCGTTACG 


TAGCCATCTG 


TCACCCCATG 


CACTACACAG 


TTATCATGAA 


CTACAAGCTC 


420 


TGTGGATTTC 


TGGTTCTGGT 


ATCTTGGATT 


GTAAGTGTTC 


TGCATGCCTT 


GTTTCAAAGC 


460 


TTGATGATGT 


TGGCGCTGCC 


CTTCTGCACA 


CATCTGCAAA 


TCCCACACTA 


CTTCTGTCAA 


S40 


CCTAATCAGG 


TGATTCAACT 


CACCTGTTCT 


GATCCATTTC 


TTAATGATCT 


TGTCATATAT 


600 


TTTACACTTG 


TGCTGCTGGC 


TACTGTTCCT 


CTTGCTGGCA 


TCTTCTATTC 


TTACTTCAAG 


660 


ATAGTGTCCT 


CCATATGTGC 


TATATCGTCA 


GTTCATGGGA 


AGTACAAAGC 


ATTCTCCACC 


720 


TGTGCATCTC 


ACCTTTCAGT 


CGTGTCTTTA 


TTTTACTGCA 


CAGGACTAGC 


AGTGTACCTC 


780 


AGTTCTGCTG 


CAAACAACAG 


CTCACAGGCA 


AGTGCCACAG 


CCTCACTCAT 


GTACACTGTA 


840 


CTTACCCCTA 


TGGTGAACCC 


TTTTATCTAT 


AGTCTTAGGA 


ATAAAGATGT 


TAAGAGTGTT 


900 


CTGAAAAAAA 


CTCTTTGTGA 


GGAAGTTATA 


AGGAGTCCAC 


CTTCCCTACT 


TCATTTCTTC 


960 


CTACTGTTAT 


CTCATCTCCC 


TTGTTTTATT 


TTTTCTTATT 


AA 




1002 



(2) INFORMATION FOR SEQ ID HO:3i 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 942 base pairt 

(B) TYPE i nucleic acid 

(C) STRAND ID NESS : single 

(D) TOPOLOGY: linear 

(11) MOLECULE TYPE: cDNA 
(ill) HYPOTHETICAL: YES 
(iv) ANTI -SENSE: NO 

<vi) ORIGINAL SOURCE 1 

(A) ORGANISM: rat olfactory epithelium 

(B) STRAIN: Srpagua-Dawley rat 

{7) TISSUE TYPE: olfactory epithelium 

(vii) IMMEDIATE SOURCE: 
(B) CLONE: P5 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 
ATGAGCAGCA CCAACCAGTC CACTGTCACC GAGTTCCTCC TCCTGGGACT CTCCAGGCAG 60 
CCCCAGCAGC AGCAGCTCCT CTTCCTGCTC TTCCTCATCA TGTACCTGGC CACTGTCCTG 120 
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CCAAACCTGC TCATCATCCT GGCTATTGGC ACACACTCCC CCCTCCACAC CCCCATCTAC 
TTCTTCCTCA GTAACCTGTC CTTTCTCCAT CTCTGCTTCT CCTCTACCAC TGTCCCTAAA 
CTTCTGCCCA ACCATATACT TCCCACTCAC GCCATTTCCT TCTCTCGGTG TCTCACCCAC 
CTCTATTTTC TCGCTCTCTT TGGTAACATG CACAATTTCC TCCTCGCTCT CATGTCCTAT 
CACCGATTTG TCCCCATATO CCACCCTTTA CACTACACAA CAAAGATCAC CCGTCAGCTC 
TCTGTCCTCC TTGTTGTGGC CTCATCGGTT GTAGCCAACA TCAATTGTCT GTTGCACATA 
CTCCTCATCC CTCGACTCTC CTTCTGTGCA GACAACATGA TCCCCCACTT CTTCTGTGAT 
CCAACTCCCC TCCTCAAACT CTCCTCCTCA CACACACATC TCAATCACCT GATGATTCTT 
ACAGAGGCAC CTGTGGTCAT GGTCACCCCA TTTGTCTGCA TCCTCATCTC CTACATCCAC 
ATCACCTGTG CTCTCCTCAG ACT CT CATC C CCCAGGGGAG GATGCAAATC CTTCTCCACC 
TGTGGCTCCC ACCTGGCTGT GGTCTCCCTC TTCTATGGCA CCGTCATCGC TGTGTATTTC 
AACCCATCAT CCTCTCACTT AGCTGGGAGC GACATGGCAG CTGCAGTGAT GTATCCAGTG 
GTGACCCCAA TGCTGAACCC TTTCATCTAT AGCCTCAGGA ACAGCGACAT CAAAGCAGCT 
TTAAGGAAAG TGCTCGCCAT GAGATTTCCA TCTAAGCAGT AA 
(2) INFORMATION FOR SBQ ID NO: 4: 

(i) SEQUBNCS CHARACTERISTICS: 

(A) LENGTH: 936 baaa paira 

(B) TYPE: nuclaic acid 

(C) STRANDEDNESS i aingla 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPtt cONA 
(iii) HYPOTHETICAL t YES 
(iv> ANT I -SENSE t NO 

<vi) ORIGINAL SOURCE i 

(A) ORGANISM t cat olfactory apithaliua 

(B) STRAIN i Srpagua-Dawlay rat 

(?) TISSUE TYPE: olfactory •pithalium 

(vli) IMMEDIATE SOURCE: 
(B) CLONE: P6 



180 

240 

300 

360 

420 

460 

540 

600 

660 

720 

780 

840 

900 

942 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 
ATGGCTTGGA GTACTGGCCA GAACCTCTCC AC A CC AGO AC CATTCATCTT GCTGGCCTTC 
CCAGGGCCAA GGAGCATCCC CATTGGGCTC TTCCTCCTTT TCCTGGTCAT CTATCTGCTT 
ACGGTAGTTG GAAACCTAGC CATCATCTCC CTGGTAGGTG CCCACAGATG CCTACAGACA 
CCCATGTACT TCTTCCTCTG CAACCTCTCC TTCCTGGAGA TCTGGTTCAC CACAGCCTGC 
GTACCCAAGA CCCTGGCCAC ATTTGCGCCT CGGGGTGGAG TCATTTCCTT GGCTGGCTGT 



60 
120 
180 
240 
300 
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GCCACACAGA TCTACTTTCT CTTTTCTTTC GCCTGTACOC ACTACTTCCT CCTGCCTCTG 360 

ATGGCTTATC ACCCCTACCT CCCCATCTCC CTGCCACTCC GCTATCCTGG CATCATCACT 420 

CCTGGCCTCG CGATGCGGTT GGCCCTGCGA TCCTGCCTCT GTGCGTTTTC TCCAATCACA 480 

CTTCCTCCTA CCCTCATTCC CCCCCTCTCT TTCTCTCCCT CACCTGTCAT CAACCACTTC 540 

TTCTCTGACA TTTCCCCCTO GATACTCCTT TCCTGCACCC ACACGCAGCT OCTOCAACTC 600 

CTCTCCTTTG GCATTGCCTT CTGTGTTATT CTGCGCTCGT GTCCTATCAC ACTACTCTCC 660 

TATCCTTACA TCATCACTAC CATCATCAAC ATTCCCTCTC CCCGOGCCCG CCACCCCCCC 720 

TTCTCAACCT GCTCATCCCA TCTCACTCTC CTCCTOATTT GGTATCGCTC CACCATCTTC 780 

TTGCATGTCA GCACCTCGCT AGAGAGCTCC TTGGACCTCA CCAAAGCTAT CACACTGCTC 840 

AACACCATTC TCACACCTCT GCTCAACCCT TTCATATATA CTCTCAGGAA CAAGCATGTC 900 

AAGGAAGCTC TGCGCAGGAC GGTGAAGGGG AAGTGA 936 
(2) INFORMATION FOR SEQ ID NO: 5: 

<i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 939 basa pairs 

(B) TYPE i nucltic *c id 

(C) STRANDEDNESS : aingla 

(D) TOPOLOGY t linaar 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: YES 
(iv) ANTI-SENSE: NO 

<vi) ORIGINAL SOURCE i 

(A) ORGANISM: rat olfactory •pithelium 
(8) STRAIN i Srpagua-Dawlay rat 

(F) TISSUE TYPEi olfactory apithaliua 

(vii) IMMEDIATE SOURCE: 

(B) CLONE t 114 

(xi) SEQUENCE DESCRIPTION: SEQ ID NOtSi 



ATGACTGCAA 


ATAACCAAAC 


TTTGATCTTG 


GACTTCCTCC 


TCCTGCCTCT 


GCCCATCCCA 


60 


TCAGAGTATC 


ATCTCCTGTT 


CTATGCCCTG 


TTCCTGGCCA 


TGTACCTCAC 


CATCATCCTC 


120 


GGAAACCTGC 


TAATCATTGT 


CCTTGTTCCA 


CTGGACTCTC 


ATCTCCACAT 


GCCCATGTAC 


180 


TTGTTTCTCA 


GCAACTTGTC 


CTTCTCTCAC 


CTCTGCTTTT 


CCTCTGTCAC 


AATCCCCAAA 


240 


TTCCTTCAGA 


ACATGCAGAG 


CCAAGTACCA 


TCTATATCCT 


ATACAGGCTG 


CCTGACACAG 


300 


CTGTACTTCT 


TTATGGTTTT 


TGGAGATATG 


CAGAGCTTCC 


TTCTTGTCCT 


CATGGCCTAT 


360 


GACCCCTATC 


TGGCCATTTG 


CTTTCCTTTG 


CGTTACACCA 


CCATCATGAC 


CACCAAGTTC 


420 


TGTGCTTCAC 


TACTGCTACT 


TCTGTGGATG 


CTGACGATGA 


CCCATGCCCT 


GCTGCATACC 


480 
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CTACTCATTG CTAGATTCTC TTTTTGTCAG AAGAATCTGA TTCTTCACTT TTTCTCTCAC 540 

ATTTCTGCTC TTCTGAAGTT GTCCTGCTCA GACATTTATG TTAATGAGCT GATGATATAT 600 

ATCTTCGGTG GACTCATCAT TATTATCCCA TTCCTATTAA TTCTTATGTC CTATGTTAGA 660 

ATTTTCTTCT CCATTTTGAA GTTTCCATCT ATTCAGGACA TCTACAAGCT ATTCTCAACC 720 

TCTGCTTCCC ATCTGTCTGT GGTGACCTTG TTTTATCGGA CAATTTTTGG TATCTACTTA 780 

TGTCCATCAG GTAATAATTC TACTGTGAAG GAGATTGCCA TGCCTATCAT GTACACACTC 640 

GTGACTCCCA TGCTGAATCC CTTCATCTAC AGCCTGAGGA ACAGAGACAT GAAAAGGGCC 900 

CTAATAAGAG TTATCTGCAC TAAGAAAATC TCTCTGTAA 939 
(2) INFORMATION FOR SEQ ID NO: 6: 

(1) SEQUENCE CHARACTERISTICS t 

(A) LENGTH: 945 ba«» pairs 

(B) TYPE: nucl«ic acid 

(C) STRANDEDNESS: aingla 
{0) TOPOLOGY: linaar 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: YES 

<iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: rat olfactory apithalium 

(B) STRAIN: Srp«gua-Dawl«y rat 

(F) TISSUE TYPE: olfactory •pithalium 

(vil) IMMEDIATE SOURCE: 
(B) CLONE: 1X5 



(xi) SEQUENCE DESCRIPTION: SEQ ID NOt6: 



ATGACAGAAG 


AGAACCAAAC 


TGTGATCTCC 


CAGTTCCTTC 


TCCTTTTCCT 


GCCCATCCCC 


60 


TCAGAGCACC 


AGCACCTCTT 


CTACCCCCTG 


TTCCTGTCCA 


TGTACCTCAC 


CACTGTCCTG 


120 


GGGAACCTCA 


TCATCATCAT 


CCTCATTCAC 


CTGGACTCCC 


ATCTCCACAC 


ACCCATCTAC 


180 


TTGTTTCTCA 


GCAACTTGTC 


CTTCTCTGAT 


CTCTGCTTTT 


CCTCTGTTAC 


GATGCCCAAG 


240 


TTGTTGCAGA 


ACATGCAGAG 


CCAAGTTCCA 


TCCATCCCCT 


TTGCAGGCTG 


CCTGACACAA 


300 


TTATACTTTT 


ACCTGTATTT 


TCCAGACCTT 


GAGAGCTTCC 


TGCTTGTCCC 


CATGCCCTAT 


360 


CACCCCTATC 


TCGCCATCTG 


CTTCCCCCTT 


CATTACATGA 


GCATCATGAC 


CCCCAAGCTC 


420 


TGTGTGAGTC 


TGGTGGTCCT 


GTCCTGGGTG 


CTGACCACCT 


TCCATGCCAT 


GCTGCACACC 


480 


CTGCTCATGG 


CCAGATTGTC 


ATTCTGTGCG 


GACAATATGA 


TCCCCCACTT 


TTTCTGTGAT 


540 


ATATCTCCTT 


TATTGAAACT 


CTCCTGCTCT 


GACACCCATG 


TTAATGACTT 


GCTCATATTT 


600 
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CTCATGGGAG GCCTTCTTAT TCTCATTCCA TTTGTCCTCA TCATTCTATC TTATCCACCA 



660 



GTTGTCCCCT CCATTCTTAA AGTCCCTTCT CTCCCAGCCA TCCACAACAT CTTCTCCACC 



720 



TGCGGCTCCC ATCTCTCTCT GCTGTCACTC TTCTATGGGA CAATCATTGG TCTCTACTTA 



780 



TGTCCGTCAC CTAATAACTC TACTGTGAAG GAGACTCTCA TGGCCATGAT GTACACAGTG 



840 



GTCACCCCCA TGCTGAACCC CTTCATCTAC AGCCTGAGCA ACAGAGACAT GAAAGAGGCA 



900 



CTGATAAGAG TCCTTTGTAA AAAGAAAATT ACCTTCTGTC TATGA 



945 



(2) INFORMATION FOR SEQ ID NOj7: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH i 933 baa* pairs 

(B) TYPBi nuclaic acid 

(C) STRANDED NESS : eingla 

(D) TOPOLOGY: linaar 

(11) MOLECULE TYPE: CDNA 
< ill) HYPOTHETICAL: YES 
(iv) ANTI-SENSE; NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: rat olfactory epithelium 

(B) STRAIN : Srpagu«-Dawl«y rat 

<F) TISSUE TYPE: olfactory •pithalium 

(vii) IMMEDIATE SOURCE: 
( B) CLONE: 13 

(xl) SEQUENCE DESCRIPTION: SEQ XD NO; 7: 

ATGAACAATC AAACTTTCAT CACCCAATTC CTTCTCCTGG CACTGCCCAT CCCTGAAGAA 60 

CATCAGCACC TCTTCTATGC CTTGTTCCTG GTCATGTACC TCACCACCAT CTTGGGAAAC 120 

TTGCTAATCA TTCTACTTCT TCAACTCCAC TCCCAGCTCC ACACACCTAT GTATTTGTTT 180 

CTCAGCAATT TGTCTTTCTC TGATCTATGT TTTTCCTCTG TCACAATGCC CAAGCTGCTC 240 

CAGAACATGA GGAGCCAOGA CACATCCATT CCCTATGGAG GCTGCCTCGC ACAAACATAC 300 

TTCTTTATGG TTTTTGGAGA TATGGACACT TTCCTTCTTG TGGCCATGGC CTATCACCGC 360 

TATGTGCCCA TCTGCTTCCC TCTGCATTAC ACCAGCATCA TGAGCCCCAA CCTCTGTACT 420 

TGTCTAGTGC TGTTATTGTG GATGCTGACG ACATCCCATG CCATGATGCA CACACTGCTT 480 

GCAGCAAGAT TGTCTTTTTC TGAGAACAAT GTCCTCCTCA ACTTCTTCTG TGACCTATTT 540 

GTTCTCCTAA AGCTGGCCTG CTCACACACT TATATTAATG ACTTGATGAT ATTTATCATG 600 

AGTACACTCC TCATTATTAT TCCATTCTTC CTCATTGTTA TGTCCTATGC AAGGATCATA 660 

TCCTCTATTC TTAAGGTTCC ATCTACCCAA GGCATCTGCA AGGTCTTCTC TACCTGTGGT 720 
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TCCCATCTCT CTCTACTATC ACTCTTCTAT GGCACAATTA TTCCTCTCTA CTTATCTCCA 780 

CCAGCTAATA ATTCCACTCT AAAACACATC CTCATGCCCA TGATGTACAC TCTGGTCACC 840 

CCCATGCTGA ATCCCTTCAT CTACAGCCTA AGGAATAGAG ATATGAACAG GGCCCTAATA 900 

AGACTTATCT GTAGTATCAA AATCACTCTG TAA 933 

(2) INFORMATION FOR SEQ ID NO1S1 

(i) SEQUENCE CHARACTERISTICS t 

(A) LENGTH : 904 baaa pairs 

(8) TYPEt nueUlc acid 

(C) STRAND ED HESS : aingU 

(D) TOPOLOGY t linear 

(ii) MOLECULE TYPE: CDNA 

(iii) HYPOTHETICAL: YES 

<iv) ANTI-SENSE : NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: rat olfactory •pith«liun 

<B) STRAIN: Srpagua-Dawlty rat 

(F) TISSUE TYPE: olfactory •pithalium 

<vll) IMMEDIATE SOURCE : 
(8) CLONE: 17 



(xi) SEQUENCE DESCRIPTION : SSQ ID NO:8: 



ATGGAGCGAA 


GGAACCACAG 


TGGGAGAGTG 


AGTGAATTTG 


TGTTGCTCCG 


TTTCCCAGCT 


60 


CCTGCCCCAC 


TGCGAGTACT 


ACTATTTTTC 


CTTTCTCTTC 


TGGACTATGT 


CTTGGTGTTG 


120 


ACTGAAAACA 


TGCTCATCAT 


TATAGCAATT 


AGGAACCACC 


CAACCCTCCA 


CAAACCCATG 


180 


TATTTTTTCT 


TGCCTAATAT 


CTCATTTCTG 


GAGATTTGGT 


ATGTCACTGT 


TACGATTCCT 


240 


AAGATGCTCG 


CTCCCTTCAT 


TCGTTCCAAG 


GAGAACCATC 


GACAGCTGAT 


CTCCTTTGAG 


300 


GCATCCATGA 


CACAACTCTA 


CTTTTTCCTG 


GGCTTGGGTT 


GCACAGAGTG 


TGTCCTTCTT 


360 


GCTGTGATGG 


CCTATGACCC 


CTATGTGGCT 


ATCTGTCATC 


CACTCCACTA 


CCCCGTCATT 


420 


GTCAGTAGCC 


GCCTATGTCT 


GCAGATGCCA 


CCTGGATCCT 


GGGCTGGAGC 


TTTTGGTATC 


480 


TCCATGGTTA 


AAGTTTTCCT 


TATTTCTCGC 


CTGTCTTACT 


GTGGCCCCAA 


CACCATCAAC 


540 


CACTTTTTCT 


GTGATCTGTC 


TCCATTGCTC 


AACCTCTCAT 


GCACTGACAT 


GTCCACAGCA 


600 


CAGCTTACAG 


ACTTTGTCCT 


GGCCATTTTT 


ATTCTGCTGG 


CACCGCTCTC 


TGTCACTGGG 


660 


GCATCCTACA 


TGGCCATCAC 


AGGTGCTGTC 


ATGCGCATCC 


CCTCAGCTGC 


TGGCCGCCAT 


720 


AAAGCCTTTT 


CAACCTCTGC 


CTCCCACCTC 


ACTGTTGTCA 


TCATCTTCTA 


TGCACCCAGT 


780 


ATTTTCATCT 


ATGCCAGGCC 


TAAGGCACTC 


TCAGCTTTTC 


ACACCAACAA 


CCTGCTCTCT 


840 


GTACTCTACG 


CTGTCATTGT 


ACCGTTGTTC 


AATCCCATCA 


TCTACTGCTT 


GCGCAACCAA 


900 
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GATGTCAAAA CACCCCTACC TOGCACGCTG CACCTGGCCC ACCACCACCA CCCCAATACC 960 
AACAAACCCA CCAAAATTCC TTAC 984 
(2) INFORMATION POR SEQ 10 NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH i 939 *>••• pmirm 

(B) TYPEj nucUic acid 

(C) STRANDSDNSSS t «ingU 
(0) TOPOLOGY t ilnur 

<li) MOLECULE TYPEi cONA 

(iii) HYPOTHETICAL: If IS 

(iv) ANTI-SENSE t NO 

(vl) ORIGINAL SOURCE : 

(A) ORGANISM: rat olfactory •pithaliua 

(B) STRAIN) Srpagua-Dawlay rat 

(P) TISSUE TYPE: olfactory •pithaliua 

<vit) IMMEDIATE SOURCE i 
(B) CLONE: 18 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:9i 



ATGAACAACA 


AAACTGTCAT 


CACCCATTTC 


CTCCTCCTGC 


GATTGCCCAT 


CCCCCCAGAG 


60 


CACCAGCAAC 


TGTTCTTTGC 


ccTcrrccTG 


ATCATGTACC 


TCACCACCTT 


TCTGGGAAAC 


120 


CTCCTAATTC 


TTCTCCTTGT 


TCAACTGGAC 


TCTCATCTCC 


ACACACCCAT 


CTACTTGTTT 


160 


CTCAGCAACT 


TGTCCTTCTC 


TGATCTCTGC 


TTTTCCTCTG 


TTACAATGCT 


GAAATTGCTC 


240 


CAAAATATAC 


AGAGCCAAGT 


ACCATCTATA 


TCCTATGCAG 


GATGCCTCAC 


ACACATATTC 


300 


TTCTTTTTGT 


TGTTTGGCTA 


CCTTGGGAAT 


TTCCTTCTTG 


TAGCCATGCC 


CTATGACCGC 


360 


TATCTCCCCA 


TCTGCTTCCC 


TCTCCATTAT 


ACCAACATCA 


TGAGCCATAA 


GCTCTGTACT 


420 


TGTCTCCTGC 


TGCTATTTTG 


GATAATCACA 


TCATCTCATC 


CCATGATGCA 


CACCCTGCTT 


4S0 


GCAGCAAGAT 


TCTCTTTTTG 


TGACAACAAT 


GTACTCCTCA 


ACTTTTTCTG 


TGACCTGTTT 


S40 


GTTCTCCTAA 


AGTTGGCCTG 


CTCACACACT 


TATGTTAATG 


AGTTGATGAT 


ACATATCATG 


600 


GGCGTCATCA 


TCATTCTTAT 


TCCATTOGTC 


CTCATTGTTA 


TATCCTATGC 


CAAGATCATC 


660 


TCCTCCATTC 


TTAAGGTTCC 


ATCTACTCAA 


AGCATTCACA 


AGGTCTTCTC 


CACTTGTGCT 


720 


TCTCATCTCT 


CTGTGGTGTC 


TCTGTTCTAC 


GGCACAATTA 


TTGCTCTCTA 


TTTATGTCCA 


780 


TCAGCTGATA 


ATTTTAGTCT 


AAAGGGGTCT 


GCCATGCCTA 


TGATGTACAC 


AGTGGTAACT 


840 


CCAATCCTGA 


ACCCCTTCAT 


CTACAGCCTA 


AGAAACACAC 


ACATGAAGCA 


GGCCCTAATA 


900 


ACAGTTACCT 


GTAGCAAGAA 


AATCTCTCTC 


CCATGGTAG 






939 



(2) INFORMATION FOR SEQ ID NO: 10: 
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U) sequence characteristics! 

(A) LENGTH! 945 bat* pair* 

(B) TYPE i nucleic acid 

(C) STRANDEDNESS: tingle 
<D) TOPOLOGY; linear 

<ii) MOLECULE TYPE: CONA 

(lii) HYPOTHETICAL! YES 

(iv) ANTI-SENSE: NO 

(vl) ORIGINAL SOURCE! 

(A) ORGANISM i rat olfactory epithelium 

(B) STRAINt Srpague-Dawley rat 

(P) TISSUE TYPEt olfactory epithelium 

(vii) IMMEDIATE SOURCE: 
(B) CLONE : 19 



(xi) SEQUENCE DESCRIPTION: SBQ ID NO: 10: 



ATGACTAGAA 


GAAACCAAAC 


TGCCATCTCT 


CAGTTCTTCC 


TTCTGGGCCT 


GCCATTCCCC 


60 


CCAGAGTACC 


AACACCTGTT 


CTATGCCCTG 


TTCCTGGCCA 


TGTACCTCAC 


CACTCTCCTG 


120 


GGGAACCTCA 


TCATCATCAT 


CCTCATTCTA 


CTGGACTCCC 


ATCTCCACAC 


ACCCATGTAC 


180 


TTGTTTCTCA 


GCAATTTATC 


CTTTGCCGAC 


CTCTGTTTTT 


CCTCTGTCAC 


AATGCCCAAG 


240 


TTGTTCCAGA 


ACATGCAGAG 


CCAACTTCCA 


TCCATCCCCT 


ATGCAGGGTC 


CCTGGCACAG 


300 


ATATACTTCT 


TTCTCTTTTT 


TGCAGACCTT 


GGAAACTTCC 


TGCTTCTGGC 


CATGGCCTAT 


360 


GACCGCTATG 


TGGCCATCTG 


CTTCCCCCTT 


CATTACATGA GCATCATGAG 


CCCCAAGCTC 


420 


TGTCTGAGTC 


TCGTGGTGCT 


CTCCTGGGTG 


CTGACTACCT 


TCCATCCCAT 


GCTCCACACC 


480 


CTGCTCATGG 


CCAGATTGTC 


ATTCTGTGAG 


GACACTGTGA 


TCCCTCACTA 


TTTCTGTGAT 


540 


ATGTCTACTC 


TGCTGAAAGT 


GCCTTGTTCT 


GACACCCATG 


ATAATGAATT 


AGCAATATTT 


600 


ATCTTAGCGC 


GCCCTATAGT 


TGTACTACCT 


TTCCTTCTCA 


TCATTGTTTC 


TTATGCAAGA 


660 


ATTGTTTCCT 


CCATCTTCAA 


GGTCCCTTCT 


TCTCAAAGCA 


TCCATAAACC 


CTTCTCCACC 


720 


TGTGGCTCCC 


ACCTGTCTGT 


GGTCTCACTG 


TTCTATGGGA 


CAGTCATTGC 


TCTCTACTTA 


780 


TGTCCTTCAG 


CTAATAACTC 


CACTGTCAAG 


GAGACTGTCA 


TGTCTTTCAT 


CTACACAATG 


840 


CTCACACCCA 


TGCTGAACCC 


CTTCATCTAC 


AGCCTAAGAA 


ACAGAGACAT 


AAAAGATGCA 


900 


TTAGAAAAA* 


TAATGTGCAA 


AAAGCAAATT 


CCCTCCTTTC 


TATCA 
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(2) INFORMATION FOR SEQ ID NO: 11 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 64$ baa* pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: linear 
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(11) MOLECULE TYPEi cDNA 
(ill) HYPOTHETICAL* YES 
(lv) ANTJ-SENSBl HO 

<vi) ORIGINAL SOURCE t 

(A) ORGANISMt hWDOsaplen 

(vlt) IMMEDIATE SOURCE i 

(B) CLONE i HS 

(lx) FEATURES 

(A) NAME/KEY: CDS 

(B) LOCATION i 1..64S 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: Hi 

ATC TGT TTT GTG TCT ACC ACT GTC CCA AAG CAG CTG CTG AAC ATC CAC 48 
lie Cys Phe Val Ser Thr Thr Val Pro Lys Gin Leu Val A»n lie Gin 

15 10 15 

ACA CAG AGC AG A GTC ATC ACC TAT CCA GAC TGC ATC ACC CAG ATG TGC 96 
Thr Gin Ser Arg Val lie Thr Tyr Ala Asp Cye Ut Thr Gin Met Cys 

20 25 30 

TTT TTT ATA CTC TTT GTA GTG TTG GAC AGC TTA CTC CTG ACT GTG ATC 144 
Phe Phe lie Leu Phe Val Val Leu Asp Ser Leu Leu Leu Thr Val Met 
35 40 45 

GCC TAT GAC CGG TTT CTC GCC ATC TCT CAC CCC CTG CAC TAC ACA GTC 192 
Ala Tyr Asp Arg. Phe Val Alt lis Cys His Pro Leu His Tyr Thr Val 
50 55 60 

ATT ATG AGC TCC TOO CTC TGT GGA CTG CTG GTT CTG GTC TCC TGC ATC 240 
lis Met Ser Ser Trp Leu Cys Gly Leu Leu Val Leu Vsl Ser Trp lis 
65 70 75 80 

GTG AGC ATC CTA TAT TCT CTC TTA CAA AGC ATA ATG GCA TTG CAG CTG 288 
Val ser lie Leu Tyr Ser Leu Leu Gin Ser lie Met Ale Leu Gin Leu 

85 90 95 

TCC TTC TCT ACA CAA CTG AAA ATC CCT CAA TTT TTC TCT GAA CTT AAT 336 
Ser Phe Cys Thr Glu Leu Lys lie Pro Gin Phe Phe Cys Glu Leu Asn 

100 105 110 

CAG GTC ATC CAC CTT GCC TGT TCC GAC ACT TTT ATT AAT GAC ATG ATG 384 
Gin Vsl lie His Leu Als Cys Ser Asp Thr Phe lie Asn Asp Met Met 

115 120 125 

ATG AAT TTT ACA AGT GTC CTC CTG GGT CGG GGA TGC CTC CCT GGA ATA 4 32 

Met Asn Phe Thr Ser Vsl Leu Leu Gly Gly Gly Cys Leu Ala Gly lie 
130 135 140 

TTT TAC TNN TAC TTT AAG ATA CTT TGT TGC ATA TGT TCG ATC TCA TCA 480 
Phe Tyr Xaa Tyr Phe Lys lie Leu Cys Cys lie Cys Ser lie Ser Ser 

145 150 155 160 

OCT CAG CGG ATG AAT AAA GCA CTT TCC ACC TGT GCA TCT CAC CTC TCA 528 
Ala Gin Gly Met Asn Lys Ala Leu Ser Thr Cys Ala Ser Hit Leu Ser 

165 170 175 
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CTT CTC TCC TTA TTT TAT TCT ACA CCC GTA OCT GTC TAC CTT ACT TCT 576 

Val Val Ser Lau Pha Tyr Cya Thr Gly Val cly Val Tyr Leu Ser Ser 

X80 185 190 

OCT CCA ACC CAT AAC TCA CTC TCA AAT OCT CCA CCC TCC CTC ATC TAC 624 
Ala Ala Thr Hit Aan Ser Leu Ser Aan Ala Ala Ala Ser Val Mat Tyr 

195 200 205 

ACT CTC CTC ACC TCC ATC CTC 645 
Thr Val Val Thr Ser Mat Leu 

210 215 

(2) INFORMATION FOR StQ ID NOil2: 

(1) SEQUENCE CHARACTERISTICS t 

(A) LENGTH i 215 amino aclde 

(B) TYPE i amino acid 
(D) TOPOLOGY: linaar 

(ii) MOLECULE TYPE: protain 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 

I la Cye Pha Val Sar Thr Thr Val Pro Lye Cln Leu Val Aan I la Cln 

15 10 15 

Thr Gin Sar Arg Val I la Thr Tyr Ala Aep Cya I la Thr Cln Mat Cya 

20 25 30 

Pha Pha I la Lau Pha Val Val Lau Aap Sar Lau Lau Lau Thr Val Mat 

35 40 45 

Ala Tyr Aap Arg Pha Val Ala I la Cya Hia Pro Lau Hia Tyr Thr Val 

50 55 60 

I la Mat Sar Sar Trp Lau Cya Gly Lau Lau Val Lau Val Sar Trp I la 

65 70 75 80 

Val Sar I la Lau Tyr Sar Lau Lau Gin Sar lie Mat Ala Lau Cln Lau 

85 90 95 

Sar Pha Cya Thr Glu Lau Lya I la Pro Cln Pha Pha Cya Clu Lau Aan 

100 105 110 

Gin Val lie Hia Lau Ala Cya Sar Aap Thr Pha I la Aan Aap Mat Mat 

115 120 125 

Met Aan Pha Thr Ser Val Leu Lau Cly Cly Cly Cya Lau Ala Cly lie 

130 135 140 

Pha Tyr Xaa Tyr Pha Lya lie Leu Cya Cya lie Cya Ser lie Sar Sar 

145 150 155 160 

Ala Cln Gly Mat Aan Lya Ala Leu Ser Thr Cya Ala Ser Hia Leu Ser 

165 170 175 

Val Val Sar Leu Pha Tyr Cyi Thr Cly Val Cly Val Tyr Leu Ser Sar 

180 185 190 

Ala Ala Thr His Aan Ser Leu Ser Aan Ala Ala Ala Ser Val Met Tyr 

195 200 205 
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Thr Val Val Thr Ser Mst Leu 

210 21S 

(2) INFORMATION FOR SEQ ID NOil3: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 640 base pairs 

(B) TYPE * nucleic acid 

(C) STRANDEDNESSi single 

(D ) TOPOLOGY i linear 

(11) MOLECULE TYPE i cONA 
(ill) HYPOTHETICAL I YES 
(iv) ANT I -SENSE j NO 

(vl) ORIGINAL SOURCE : 

(A) ORGANISM: rat olfactory epithelium 

(B) STRAIN: Srpague-Dawley rat 

if) TISSUE TYPE: olfactory epithelium 

<vii) IMMEDIATE SOURCES 
<B) CLONE: Jl 

(ix) FEATURE : 

(A) NAME/KEY: COS 
(0) LOCATION: 2.. 640 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13 1 

C ATC TCC TTT ACT TCT GOT AGC ATC CCA AAC ATG CTA GTG AAT ATA 46 
lie Cys Phe Thr Ser Ala Ser lie Pro Lya Met Leu Val Aan lie 

15 10 15 

CAG ACQ AAG AAC AAG GTG ATC ACC TAT GAA GCC TGC ATC TCC CAA GTA 94 
Gin Thr Lya Aen Lya Val He Thr Tyr Glu Gly Cye He Ser Gin Val 

20 25 30 

TAC TTT TCA TAC TCT TTG GAG TTT TOG ACA ACT TTC TTC TOG ACT GTG 142 
Tyr Phe Ser Tyr Ser Leu Glu Phe Trp Thr Thr Phe Phe Ser Thr Val 

35 40 45 

ATG GCC TAT GAC CCA TAT GTG GCC ATC TCT CAC CCA TCT NAC TAC ACA 190 
Met Ala Tyr Asp Arg Tyr Val Ala He Cye His Pro Ser Xaa Tyr Thr 

50 55 60 

GGT CAT CAT GAA CCN NNN NNN NNN NNN NNN NNN NNN NNN NNN NNN NNN 238 
Gly His His Glu Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 

65 70 75 

NNN NNN NNN NNN NNN NNN NNN NNN NNN NNN NNN NNN NNN NNN NNN NNN 286 
Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 

60 85 90 95 

NNN NNN NNN NNN NNN NNN NNN NNN NNN NNN NNN NNN NNN NNN NNN NNN 334 
Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 

100 105 110 

NNN NNN NNN NNN NNN NNN NNN NNN NNN NNN NNN NNN NNN NNN NNN NNN 382 
Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 
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115 120 125 

NNN NNN NNN NNN NNN NNN NNN NNN NNN NNN NNN NNN NNN NNN NNN NTT 430 
Xaa Xaa Xaa X44 Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 

130 135 140 

TAT TCT TAC TCT AAC ATA CTT TCC TCC ATA OCA GAA ATC TCA TCA TCA 478 
Tyr Sar Tyr Sar Lyi I la Val Sar 5ar I la Arg Clu Ila Sar Sar Sar 

145 ISO 155 

CAC CCA AAC TAC AAC NNA TTC TCC ACC TCT CCA TCC CAC CTC TCA CTT 526 
Cln Cly Lya Tyr Lyi Xaa Pha Sar Thr Cya Ala Sac Hia Lau Sar Val 

160 165 170 175 

CTT TCA TTA TTC TAT TCT ACA CTT TTC OCT CTC TAC CTT ACT TCT TCT 574 
Val Sar Lau Pha Tyr Sar Thr Lau Lau Cly Val Tyr Lau Sar Sar Sar 

160 165 190 

TTT ACC CAA AAC TCA CAC TCA ACT CCA CCC CCA TCT CTT ATC TAC ACT 622 
Pha Thr Cln Aan Sar Hia Sar Thr Ala Arg Ala Sar Val Mat Tyr Sar 

195 200 205 

GTC CTC ACC CCC ATC TTG 640 
Val Val Thr Pro Hat Lau 
210 

(2) INFORMATION FOR SEQ ID NO: 14: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 213 amino acida 

(B) TYPE; amino acid 
f D) TOPOLOGY: linaar 

(ii) MOLECULE TYPE: protain 

<xi) SEQUENCE DESCRIPTION} SEQ ID NO: 14: 

Ila Cya Pha Thr Sar Ala Sar Ila Pro Lya Mat Lau Val Aan Ila Gin 

15 10 IS 

Thr Lya Aan Lya Val Ila Thr Tyr Glu Cly Cya Ila Sar Cln Val Tyr 

20 25 30 

Pha Sar Tyr Sar Lau Glu Pha Trp Thr Thr Pha Pha Sar Thr Val Mat 

35 40 45 

Ala Tyr Aap Arg Tyr Val Ala Ila Cya Hia Pro Sar Xaa Tyr Thr Gly 

50 SS 60 

Hia Hia clu Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 

65 70 75 80 

Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 

85 90 95 

Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 

100 105 110 

Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 

115 120 125 
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X** Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Tyr 

130 135 140 

Sar Tyr Sar Lya Ila Val Sar Sar I la Arg Glu I la sar Sar Sar Gin 

145 150 155 160 

Gly Lya Tyr Lya Xaa Pha Sar Thr Cya Ala Sar Hla Lau Sar Val Val 

165 170 175 

Sar Lau Pha Tyr Sar Thr Lau Lau Gly Val Tyr Lau Sar Sar Sar Pha 

180 185 190 

Thr Gin Aan Sar Hia Sar Thr Ala Arg Ala Sar Val Mat Tyr Sar Val 

195 200 205 

Val Thr Pro Mat Lau 

210 

(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS : 

<A) LENGTH: 636 baaa paira 

(B) TYPE: nuclaic acid 

(C) STRANOEDNESS : a ingle 
<D> TOPOLOGY: linaar 

(ii) MOLECULE TYPEi protein 

(iii) HYPOTHETICAL : YES 

(iv) ANT I -SENSE: NO 

(vi) ORIGINAL SOURCE t 

(A) ORGANISM: rat olfactory epithelium 

(B) STRAIN i erpague-Dawley rat 

(FJ TISSUE TYPE: olfactory apithalium 

(vii) IMMEDIATE SOURCE: 
(B) CLONES J2 

<ix> FEATURE t 

(A) NAME /KEY: CDS 

(B) LOCATION i 1..636 

{xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 

ACC TCC ACC ACC ATC CCA AAG ATG CTC GTA AAT ATA CAC ACC CAG ACC 48 
Thr Sar Thr Thr Ila Pro Lya Mat Lau VaX Aan Ila Hia Thr Gin Sar 

15 10 15 

AAT ACT ATC ACC TAT GAA CAC TGT ATT TCC CAG ATG TTT GTA CTC TTG 96 
Aan Thr Ila Thr Tyr Glu Aap Cya Ila Sar Gin Mat Pha Val Lau Lau 

20 25 30 

GTT TTT GO A GAA CTC CAC AAC TTT CTC CTC OCT CTC ATC GCC TAT GAT 144 
Val Pha Gly Glu Lau Aap Aan Pha Lau Lau Ala V«l Mat Ala Tyr Aap 
35 40 45 

CGA TAT GTC OCT ATC TGT CAC CCA CTC TAT TAC ACA GTC ATT GTG AAC 192 
Arg Tyr Val Ala He Cya Hia Pro Leu Tyr Tyr Thr Val Ila Val Aan 

50 55 60 
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CAC CCA CTC TCT ATC CTC CTC CTT CTG CTG TCC TGG CTT CTC ACC ATT 240 
Hi* Arg Lau Cy* Ila Lau Lau Lau Lau Lau Sar Trp Val Val sar 21* 

65 70 75 80 

TTA CAT GCC TTC TTA CAO AGO TTA ATT GTA CTA CAC TTG ACC TTC TCT 288 
Lau Hi* Ala Pha Lau Gin Sar Lau II* Val Lau Gin Lau Thr Pha Cya 

85 90 95 

GGA GAT GTG AAA ATC CCT CAC TTC TTC TGT GAG CTC AAT CAC CTG TCC 336 
Gly Asp Val Ly* II* Pro HI* Pha Pha Cya Glu Lau Aan Gin Lau Sar 

100 105 110 

CAA CTC ACA TGT TCA GAC AAC TTT CCA AGT CAC CTC ACA ATC CAT CTT 384 
Gin Lau Thr Cya Sar Acp A*n Pha Pro Sar HI* Lau Thr Mat Hia Lau 

115 120 125 

GTA CCT GTT ATA TTT GCA OCT ATT TCC CTC AGT GOT ATC CTT TAC TCT 432 
Val Pro Val Ila Pha Ala Ala Ila Sar Lau Sar Gly II* Lau Tyr Sar 

130 135 140 

TAT TTC AAG ATA GTG TCT TCC ATA CGT TCT ATG TCC TCA CTT CAA GGG 480 
Tyr Pha Ly* Ila Val Sar Sar Ila Arg Sar Met Sar Sar Val Gin Gly 

145 150 155 160 

AAG TAC AAG GCA TTT TCT ACA TCT GCC TCT CAC CTT TCC ATT CTC TCC 528 
Ly* Tyr Lya Ala Pha Sar Thr Cy* Ala Sar Hia Lau Sar Ila Val Sar 

165 170 175 

TTA TTT TAT AGT ACA GCC CTC GGG GTG TAC GTC AGT TCT CCT CTC ATC 576 
Lau Pha Tyr Sar Thr Gly Lau Gly Val Tyr Val Sar Sar Ala Val Xla 

160 185 190 

CCA ACC TCA CAC TCC TCT CCA ACT CCT TCC CTC ATC TAT ACT GTC GTC 624 
Arg Sar Sar Hia Sar Sar Ala Sar Ala Sar Val Mat Tyr Thr Val Val 

195 200 205 

ACC CCC ATG TTC 636 
Thr Pro Hat Lau 

210 

(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS; 

(A) LENGTH: 212 amino acid* 
(0) TYPE: amino acid 
(0) TOPOLOGY t linaar 

(ii) MOLECULE TYPE 1 protain 

(xi> SEQUENCE DESCRIPTION: SEQ ID NO: 16: 

Thr Ser Thr Thr Ila Pro Lya Mat Lau Val A*n II* Hia Thr Gin Sar 

15 10 15 

Aan Thr Ila Thr Tyr Glu A*p Cy* Ila Sar Gin Mat Pha Val Lau Lau 

20 25 30 

Val Pha Gly Glu Leu Aap Aan Pha Lau Lau Ala Val Mat Ala Tyr Aap 

35 40 45 

hrg Tyr Val Ala Ila Cya Hie Pro Lau Tyr Tyr Thr Val Ila Val A*n 
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50 55 60 

Hie Arg Leu Cye He Leu Leu Leu Leu Leu Ser Trp Val Val Ser II* 

65 70 75 80 

Leu His Ala Phe Leu Cln Ser Leu He Val Leu Gin Leu Thr Phe Cye 

85 90 95 

Cly Aep Val Lya He Pro Hie Phe Phe Cye Clu Leu Aen Cln Leu Ser 

100 105 HQ 

Cln Leu Thr Cye Ser Asp Aen Phe pro Ser Hie Leu Thr Met Hie Leu 

115 120 125 

Val Pro Vel He Phe Ale Al* He Ser Leu Ser Cly He Leu Tyr Ser 

130 13& 140 

Tyr Phe Lye He Val Ser Ser He Arg Ser Met Ser Ser Val Cln Cly 

145 ISO 155 160 

Lye Tyr Lye Ale Phe Ser Thr Cye Ala Ser Hie Leu Ser He Vel Ser 

165 170 175 

Leu Phe Tyr Ser Thr Cly Leu Cly Val Tyr Val Ser Ser Ala Val He 

180 185 190 

Arg Ser Ser His Ser Ser Ala Ser Ala Ser Val Met Tyr Thr Vel Val 

195 200 205 

Thr Pro Met Leu 

210 

(2) INFORMATION TOR SEQ ID NOtl7: 

<i) SEQUENCE CHARACTERISTICS: 

{A) LENGTH: 646 baee pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS i single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
<Ui) HYPOTHETICAL: YES 
(iv) ANT I -SENSE: NO 

<vl> ORIGINAL SOURCE : 

(A) ORCAMISM: rat olfactory epithelium 

(B) STRAIN i erpague-Dawley rat 

(F) TISSUE TYPE: olfactory epithelium 

(vii) IMMEDIATE SOURCE: 
(B) CLONE i J4 

(ix) FEATURE: 

(A) NAME/KEY: COS 
{ B) LOCATION: 2.. 646 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 
C ATA CCC TAT TCA TCT TCT CTC ACA CCC AAT A TO CTT CTC AAC TTC 46 
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Il« Gly Tyr Sar Sar Sar Val Thr Pro Aan M«t Lau Val Am Pha 

IS 10 15 

CTT ATA AAC CAA AAT ACC ATC TCA TAC CTT CCA TCT TCT ATA CAC TTT 94 
Lau Ila Lya Gin Aan Thr lit Sar Tyr L*u Gly Cya S«r Ila Gin Pha 

20 25 30 

CCC TCA OCT CCT TTC TTT CCA CCT CTT CAA TCC TTC CTT CTC OCT CCC 142 
Cly Sar All Ala Lau Pha Cly Gly Lau Clu Cya Pha Lau Lau Ala Ala 

35 40 45 

ATG GCG TAT CAT CCT TTT CTA GCA ATC TCC AAC CCA CTC CTT TAT TCA 190 
Nat Ala Tyr Aap Acq Pha Val Ala Ila Cya Aan Pro Lau Lau Tyr Sar 

50 55 60 

ACC AAA ATC TCC ACA CAA CTC TCT CTC CAC TTC CTT CTC CCA TCT TAT 238 
Thr Lya Mat Sar Thr Gin Val Cya Val Gin Lau Val Val Cly Sar Tyr 

65 70 75 

ATA GCG CCA TTT CTT AAT GCC TCC TCT TTT ACC CTT TCC TTT TTT TCC 286 
Ila Cly Cly Pha Lau Aan Ala Sar Sar Pha Thr Lau Sar Pha Pha Sar 

80 85 90 95 

TTC TCC TTC TCT CCA CCA AAT ACA ATC AAT CAC TTT TAC TCT CAT TTT 334 
Lau Sar Pha Cya Gly pro Aan Arc, Ila Aan Hi. a Pha Tyr Cya Aap Pha 

100 105 110 

CCT CCC TTA CTA CAA CTT TCT TCC TCT CAT GTC AGT CTT CCT GAT CCT 382 
Ala Pro Lau Val Clu Lau Sar Cya Sar Aap Val Sar Val Pro Aap Ala 

115 120 125 

CTT ACC TCA TTT TCT CCT CCC TCA CTT ACT ATC CTC ACA CTC TTT ATC 430 
Val Thr Sar Pha Sar Ala Ala Sar Val Thr Hat Lau Thr Val Pha Ila 
130 135 140 

ATA GCC ATC TCC TAT ACC TAT ATC CTC ATC ACC ATC CTC AAC ATG CCT 478 
Ila Ala Ila Sar Tyr Thr Tyr Ila Lau Ila Thr Ila Lau Lya Mat Aro 
145 150 155 

TCC ACT CAC CCT CCA CAC AAA GCA TTC TCT ACC TCC ACT TCC CAC CTC 526 
Sar Thr Clu Cly Arg Cln Lya Ala Pha Sar Thr Cya Thr Sar Hia Lau 
160 165 170 175 

ACT CCA CTC ACT CTC TCC TAT GCA ACC ATC ACA TTC ATC TAT GTC ATC 574 
Thr Ala Val Thr Lau Cya Tyr Cly Thr Ila Thr Pha IU Tyr Val Mat 

180 185 190 

CCC AAC TCC ACC TAC TCC ACA CAC CAC AAC AAG CTC CTG TCT GTC TTT 622 
Pro Lya Sar Sar Tyr Sar Thr Aap Cln Aan Lya Val Val Sar Val Pha 

195 200 205 

TAT ATC CTG GTC ATC CCC ATC TTC 646 
Tyr hat Val Val XI* Pro Mat Lau 

210 215 

(2) INFORMATION FOR SZQ ID NO: 18 i 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 215 amino acida 

(B) TYPE: amino acid 
(D) TOPOLOGY: linaar 
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(U) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION; SEQ 10 NO; 18: 

Ila Gly Tyr S«r 5«r Sar Val The Pro Aan Nat Lau V*l Aan Pha Lau 

15 10 15 

Ila Lya Gin Aan Thr lis Sar Tyr Leu Gly Cya Sar Ila Gin Pha Gly 

20 25 30 

Sar Ala Ala Lau Pha Gly Gly Lau Clu Cy» Pha Lau Lau Ala Ala Mat 

35 40 45 

Ala Tyr Asp Arg Pha Val Ala I la Cys Aan Pro Lau Lau Tyr Sar Thr 

50 55 60 

Lye Nat Sar Thr Gin Val Cya Val Gin Lau Val Val Gly Sar Tyr Ila 

65 70 75 80 

Gly Gly Pha Lau Aan Ala Sar Sar Pha Thr Lau Sar Pha Pha Sar Lau 

85 90 95 

Sar Pha Cya Gly Pro Aan Arg Ila Aan Hla Pha Tyr Cya Aap Pha Ala 

100 10S 110 

Pro Lau Val Glu Lau Sar Cya Sar Aap Val Sar Val Pro Aap Ala Val 

115 120 125 

Thr Sar Pha Sar Ala Ala Sar Val Thr Mat Lau Thr Val Pha Zla Ila 

130 135 140 

Ala Ila Sar Tyr Thr Tyr Ila Lau Ila Thr Ila Lau Lya Mat Arg Sar 

14S 150 155 160 

Thr Glu Gly Arg Gin Lya Ala Pha Sar Thr Cya Thr Sar Hia Lau Thr 

165 170 17S 

Ala Val Thr Lau Cya Tyr Gly Thr Ila Thr Pha Ila Tyr Val Mat Pro 

180 185 190 

Lya Sar Sar Tyr Sar Thr Aap Gin Aan Lya Val Val Sar Val Pha Tyr 

195 200 205 

Mat Val Val Ila Pro Mat Lau 

210 215 

(2) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH t 481 baaa paira 

(B) TYPEi nuclaic acid 

(C) STRAN DE ONE SS i aingla 

(D) TOPOLOGY: linaar 

(ii) MOLECULE TYPE: CDNA 

<Ui> HYPOTHETICAL: YES 

(iv) ANTI -SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: rat olfactory apithaliun 
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(B) STRAIN: Srpagua-Dawley rat 

(T) TISSUE TYPI; o 1 factory apithalium 

(vii) IMMEDIATE SOURCE: 
(B) CLONE: J7 

(lx> FEATURE: 

(A) NAME /KEY : CDS 

(B) LOCATION: 2.. 481 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NOi19i 

C ATC TGC AAC CCC CTG CAC TAC ACC ACC ATC ATG AAT AAC CCA CTG 46 
Il« Cya Ly* Pro Lau KLa Tyr Thr Thr I la Mat Aan Aan Arg Val 

15 10 15 

TGC ACA GTT CTA CTC CTC TCC TCT TGC TTT OCT GGC CTG TTG ATC ATC 94 
Cya Thr Val Lau Val Lau Ser Cya Trp Pha Ala Gly L#u Lau Ila I la 

20 25 30 

CTC CCA CCT CTT GGT CAT GGC CTC CAG CTG GAG TTC TGT GAC TCC AAT 142 
Lau Pro Pro Lau Gly Hia Gly Lau Gin Lau Glu Pha Cya Aap Sar Aan 

35 40 45 

CTG ATT GAT CAT TTT GGC TCT GAT CCC TCT CCA ATT CTC CAG ATA ACC 190 
Val Ila Aap Hia Pha Gly Cya Aap Ala Sar Pro Ila Lau Gin Ila Thr 

50 55 60 

TCC TCA CAC ACC CTA TTT ATA CAG AAA ATT GTC TTC CCT TTT GCC ATA 238 
Cya Sar Aap Thr Val Pha Ila Glu Lya Ila Val Lau Ala Pha Ala Ila 

65 70 75 

CTC ACA CTC ATC ATT ACT CTC CTA TCT CTT CTT CTC TCC TAC ACA TAC 286 
Lau Thr Lau Ila Ila Thr Lau Val Cya Val Val Lau Sar Tyr Thr Tyr 

80 85 90 95 

ATC ATC AAC ACC ATT TTA AAC TTT CCT TCT CCT CAA CAA ACA AAA AAC 334 
Ila Ila Lya Thr Ila Lau Lya Pha Pro Sar Ala Gin Gin Arg Lya Lya 

100 105 110 

CCC TTT TCT ACA TCT TCT TCC CAC ATG ATT CTC CTT TCC ATC ACC TAT 382 
Ala Pha Sar Thr Cya Sar Sar Hia Mat Ila Val val Sar Ila Thr Tyr 

115 120 125 

CCC AGC TGT ATT TTC ATC TAC ATC AAA CCT TCA GCC AAC CAA GGC CTA 430 
Gly Sar Cya Ila Pha Ila Tyr Ila Lya Pro Sar Ala Lya Glu Gly Val 

130 135 140 

GCC ATC AAT AAC CTT CTA TCT GTC CTC ACA ACA TCA CTC CCC CCT TTC 478 
Ala Ila Aan Lya Val Val Sar Val Lau Thr Thr Sar Val Ala Pro Lau 

145 150 155 

CTC 481 
Lau 

160 

(2) INFORMATION FOR SEQ ID NO: 20: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 160 amino acida 
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(B) TYPE: unino acid 
(D) TOPOLOGY i linear 

(ii) MOLECULE TYPE i protain 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20: 

Ila Cya Lya Pro Lau Hia Tyr Thr Thr Ila Mat Aan Aan Arc. Val Cya 
1 5 .10 15 

Thr Val Lau Val Lau Sar Cya Trp Pha Ala Cly Lau Lau Ila 11m Lau 

20 25 30 

Pro Pro Lau Cly Hia Gly Lau Gin Lau Glu Pha Cya Aap Sar Aan Val 
3S 40 45 

Xla Aap Hia Pha Gly Cya Aap Ala Sar Pro Ila Lau Gin I la Thr Cya 

50 55 60 

Sar Aap Thr Val Pha Ila Glu Lya Ila Val Lau Ala Pha Ala Ila Lau 

65 70 75 80 

Thr Lau tla Ila Thr Lau Val Cya Val Val Lau Sar Tyr Thr Tyr Ila 

85 90 95 

Ila Lya Thr Ila Lau Lya Pha Pro Sar Ala Gin Gin Arc, Lya Lya Ala 

100 105 110 

Ph« Sar Thr Cya Sar Sar Hia Mat Ila Val Val Sar Ila Thr Tyr Gly 

115 120 125 

Sar Cya Ila Pha Ila Tyr Ila Lya Pro Sar Ala Lya Glu Gly Val Ala 

130 135 140 

Ila Aan Lya Val Val Sar Val Lau Thr Thr Sar Val Ala Pro Lau Lau 

145 150 155 160 

(2) INFORMATION FOR SEQ ID NO: 21: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH i 481 baaa pair a 

(B) TYPEt nuclaic acid 

(C) STRANDEDNESS i aingla 

(D) TOPOLOGY: linaar 

(ii) MOLECULE TYPE J protain 
(lii) HYPOTHETICAL t YES 
(iv) ANTI -SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: rat olfactory apithalium 

(B) STRAIN: Srpagua-Dawlay rat 

(F) TISSUE TYPE: olfactory apithalium 

(vii) IMMEDIATE SOURCE : 
{B) CLONE: J8 

<ix) FEATURE: 

(A) NAME /KEY; CDS 
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(B) LOCATION: 2.. 481 

(xi) SEQUENCE DESCRIPTIONS SEQ 10 NOi21i 

C ATC TCC CAC CCC CTC CAC TAC TCT CTT CTC A TO AOT CCT CAC AAC 46 
Ila Cya Hi* Pro Latu His Tyr Sar Lau Lau Hat Sar Pro Aap Aan 

15 10 15 

TGT CCT CCT CTC GTA ACA CTC TCC TCC CTC ACA CCC CTC CCC ACC CCC 94 
Cya Ala Ala L«u Val Thr Val Sar Trp Val Thr Cly Val Cly Thr Gly 

20 25 30 

TTC CTC CCT TCC CTC CTC ATT TCT AAC rPC CAC TTC TCT CCC CCC AAC 142 
Ph« Lau Pro Sar Lam La>u Xia Sar Lys Lau Aap Pha Cy» Cly Pro Aan 

35 40 45 

CCC ATC AAC CAT TTC TTC TCT CAC CTC CCT CCA TTA ATC CAC CTC TCC 190 
Arg Ila A«n Hia Ph« Pha Cya Aap Lau Pro Pro L«u I la cin Lau Sar 

50 55 60 

TCC TCC ACC GTC TTT CTC ACA GAA ATC GCC ATC TTT CTC CTC TCC ATC 238 
Cya Sar Sar Val Pha Val Thr Clu Mat Ala Ila Pha Val Lau Sar Ila 

65 70 75 

CCT CTC CTC TCC ATC TCT TTC CTC CTA ACC CNN NNN TCC TAC ATT TTC 286 
Ala Val Lau Cya Ila Cya Pha Lau Lau Thr Xaa Xaa Sar Tyr XI* Pha 
80 85 90 95 

ATA CTC TCC TCC ATT CTC ACA ATC CCT TCC ACT ACC CCC ACC ATC AAC 334 
Ila Val Sar Sar Ila Lau Arg Ila Pro Sar Thr Thr Gly Arg Mat Lya 

100 105 110 

ACA TTT TCT ACA TGT CCC TCC CAC CTC CCC CTC CTC ACC ATC TAC TAT 382 
Thr Pha Sar Thr Cya Cly Smr Hia Lau Ala Val Val Thr Ila Tyr Tyr 

115 120 125 

CCC ACC ATC ATC TCC ATC TAT GTC CCC CCA AAT CCC CAT CTC TCC CCC 430 
Gly Thr Hat Ila Sar Mat Tyr Val Cly Pro Aan Ala Hia Lau Sar Pro 
130 135 140 

GAG CTC AAC AAC CTC ATT TCT CTC TTC TAC ACT CTC ATC ACC CCA CTA 478 
Clu Lau Aan Lya Val Ila Sar Val Pha Tyr Thr Val Ila Thr Pro Lau 
145 ISO 155 

CTC 481 
Lau 

160 

(2) INFORMATION FOR SEQ ID NO: 22 i 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH t 160 amino acida 
<5) TYPE: amino acid 
(D> TOPOLOGY: linaar 

Ui) MOLECULE TYPE: protain 

(xi) SEQUENCE DESCRIPTION: SEQ 10 NO: 22: 

Ila Cya Kia Pro Lau Hia Tyr Sar Lau Lau Hat Sar Pro Aap Aan Cya 
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